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SIMIAN ADENOVIRUS NUCLEIC ACID AND AMINO ACID SEQUENCES, 
VECTORS CONTAINING SAME, AND METHODS OF USE 

5 BACKGROUND OF THE INVENTION 

Adenovirus is a double-stranded DNA virus with a genome size of about 36 
kilobases (kb), which has been widely used for gene transfer applications due to its ability 
to achieve highly efficient gene transfer in a variety of target tissues and large transgene 
capacity. Conventionally, El genes of adenovirus are deleted and replaced with a 
transgene cassette consisting of the promoter of choice, cDNA sequence of the gene of 
interest and a poly A signal, resulting in a replication defective recombinant virus. 
Adenoviruses have a characteristic morphology with an icosahedral capsid 
consisting of three major proteins, hexon (II), penton base (III) and a knobbed fibre (IV), 
along with a number of other minor proteins, VI, VIII, DC, Ilia and IVa2 [W.C. Russell, J. 
Gen Virol, 81:2573-2604 (Nov 2000)]. The virus genome is a linear, double-stranded 
DNA with a terminal protein attached covalently to the 5' termini, which have inverted 
terminal repeats (ITRs). The virus DNA is intimately associated with the highly basic 
protein VII and a small peptide termed mu. Another protein, V, is packaged with this 
DNA-protein complex and provides a structural link to the capsid via protein VI. The 
20 virus also contains a virus-encoded protease, which is necessary for processing of some of 
the structural proteins to produce mature infectious virus. 

Recombinant adenoviruses have been described for delivery of molecules to host 
cells. See, US Patent 6,083,716, which describes the genome of two chimpanzee 
adenoviruses. 

25 What is needed in the art are more effective vectors which avoid the effect of pre- 

existing immunity to selected adenovirus serotypes in the population and/or which are 
useful for repeat administration and for titer boosting by second vaccination, if required. 

Summary of the Invention 
30 The present invention provides the isolated nucleic acid sequences and amino acid 

sequences of six simian adenoviruses, vectors containing these sequences, and cell lines 
expressing simian adenovirus genes. Also provided are a number of methods for using 
the vectors and cells of the invention. 
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The methods of the invention involve delivering one or more selected 
heterologous gene(s) to a mammalian patient by administering a vector of the invention. 
Because the various vector constructs are derived from simian rather than from human 
adenoviruses, the immune system of the non-simian human or veterinary patient is not 

5 primed to respond immediately to the vector as a foreign antigen. Use of the 

compositions of this invention thus permits a more stable expression of the selected 
transgene when administered to a non-simian patient. Use of the compositions of this 
invention for vaccination permits presentation of a selected antigen for the elicitation of 
protective immune responses. Without wishing to be bound by theory, the ability of the 

10 adenoviruses of the invention to transduce human dendritic cells is at least partially 
responsible for the ability of the recombinant constructs of the invention to induce an 
immune response. The recombinant simian adenoviruses of this invention may also be 
used for producing heterologous gene products in vitro. Such gene products are 
themselves useful in a variety for a variety of purposes such as are described herein. 

1 5 These and other embodiments and advantages of the invention are described in 

more detail below. 

Brief Description of the Drawings 

Fig. 1 provides an alignment of the amino acid sequences of the U and a portion 

20 of the L2 loops of the capsid protein hexon of the chimpanzee adenovirus C 1 [SEQ ID 
NO:13], chimpanzee adenovirus C68 (Pan-9) [SEQ ID NO: 14], and the novel Pan5 [SEQ 
ID NO:15], Pan6 [SEQ ID NO: 16] and Pan7 [SEQ ID NO: 17] chimpanzee adenovirus 
sequences of the invention. The intervening conserved region is part of the pedestal 
domain conserved between adenovirus serotypes. 

2 5 Fig. 2 provides an alignment of the amino acid sequences of the fiber knob 

domains of chimpanzee C68 (Pan-9) [SEQ ID NO:l 8], Pan-6 [SEQ ID NO:19], Pan-7 
[SEQ ID NO:20], and Pan-5 [SEQ ID NO:21] and the human adenoviruses serotypes 2 
[SEQ ID NO:22] and 5 [SEQ ID NO:23]. 

30 DETAILED DESCRIPTION OF THE INVENTION 

The invention provides novel nucleic acid and amino acid sequences from Ad 
Pan5 [SEQ ID NO:l-4,15 and 21], Ad Pan6 [SEQ ID NO: 5-8, 16, 19], and Ad serotype 

2 
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Pan7 [SEQ ID NO: 9-12, 17, 20], which were originally isolated from chimpanzee lymph 
nodes. In several instances throughout the specification, these adenoviruses are 
alternatively termed herein C5, C6 and C7, respectively. Also provided are sequences 
from adenovirus SV1 [SEQ ID NO: 24-28], which was originally isolated from the 

5 kidney cells of cynomolgus monkey. The invention also provides sequences of 

adenoviruses SV-25 [SEQ IDNO:29-33] and SV-39 [SEQ ID NO: 34-37], which were 
originally isolated from rhesus monkey kidney cells. 

The present invention provides novel adenovirus vectors and packaging cell lines 
to produce those vectors for use in the in vitro production of recombinant proteins or 

10 fragments or other reagents. The invention further provides compositions for use in 

delivering a heterologous molecule for therapeutic or vaccine purposes. Such therapeutic 
or vaccine compositions contain the adenoviral vectors carrying an inserted heterologous 
molecule. In addition, novel sequences of the invention are useful in providing the 
essential helper functions required for production of recombinant adeno-associated viral 

15 (AAV) vectors. Thus, the invention provides helper constructs, methods and cell lines 
which use these sequences in such production methods. 

The term "substantial homology 1 * or "substantial similarity," when referring to a 
nucleic acid or fragment thereof, indicates that, when optimally aligned with appropriate 
nucleotide insertions or deletions with another nucleic acid (or its complementary strand), 

20 there is nucleotide sequence identity in at least about 95 to 99% of the aligned sequences. 

The term "substantial homology" or "substantial similarity," when referring to 
amino acids or fragments thereof, indicates that, when optimally aligned with appropriate 
amino acid insertions or deletions with another amino acid (or its complementary strand), 
there is amino acid sequence identity in at least about 95 to 99% of the aligned sequences. 

25 Preferably, the homology is over full-length sequence, or a protein thereof, or a fragment 
thereof which is at least 8 amino acids, or more desirably, at least 15 amino acids in 
length. Examples of suitable fragments are described herein. 

The term "percent sequence identity" or "identical" in the context of nucleic acid 
sequences refers to the residues in the two sequences that are the same when aligned for 

30 maximum correspondence. The length of sequence identity comparison may be over the 
full-length of the genome (e.g., about 36 kbp), the full-length of an open reading frame of 
a gene, protein, subunit, or enzyme [see, e.g., the tables providing the adenoviral coding 
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regions], or a fragment of at least about 500 to 5000 nucleotides, is desired. However, 
identity among smaller fragments, e.g. of at least about nine nucleotides, usually at least 
about 20 to 24 nucleotides, at least about 28 to 32 nucleotides, at least about 36 or more 
nucleotides, may also be desired. Similarly, "percent sequence identity" may be readily 
5 determined for amino acid sequences, over the full-length of a protein, or a fragment 
thereof. Suitably, a fragment is at least about 8 amino acids in length, and may be up to 
about 700 amino acids. Examples of suitable fragments are described herein. 

Identity is readily determined using such algorithms and computer programs as 
are defined herein at default settings. Preferably, such identity is over the full length of 
io the protein, enzyme, subunit, or over a fragment of at least about 8 amino acids in length. 
However, identity may be based upon shorter regions, where suited to the use to which 
the identical gene product is being put. 

As described herein, alignments are performed using any of a variety of publicly 
or commercially available Multiple Sequence Alignment Programs, such as "Clustal W", 
15 accessible through Web Servers on the internet. Alternatively, Vector NT1 utilities are 
also used. There are also a number of algorithms known in the art that can be used to 
measure nucleotide sequence identity, including those contained in the programs 
described above. As another example, polynucleotide sequences can be compared using 
Fasta, a program in GCG Version 6.1. Fasta provides alignments and percent sequence 
20 identity of the regions of the best overlap between the query and search sequences. For 
instance, percent sequence identity between nucleic acid sequences can be determined 
using Fasta with its default parameters (a word size of 6 and the NOPAM factor for the 
scoring matrix) as provided in GCG Version 6.1, herein incorporated by reference. 
Similarly programs are available for performing amino acid alignments. Generally, these 
25 programs are used at default settings, although one of skill in the art can alter these 
settings as needed. Alternatively, one of skill in the art can utilize another algorithm or 
computer program that provides at least the level of identity or alignment as that provided 
by the referenced algorithms and programs. 

As used throughout this specification and the claims, the term "comprise" and its 
30 variants including, "comprises", "comprising", among other variants, is inclusive of other 
components, elements, integers, steps and the like. The term "consists of or "consisting 
of* are exclusive of other components, elements, integers, steps and the like. 
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I. The Simian Adenovirus Sequences 

The invention provides nucleic acid sequences and amino acid sequences of 
Pan5, Pan6, Pan7, SV1, SV25 and SV39, which are isolated from the other viral material 
5 with which they are associated in nature. 

A. Nucleic Acid Sequences 

The Pan5 nucleic acid sequences of the invention include nucleotides 1 to 
36462 of SEQ ID NO:l. The Pan6 nucleic acid sequences of the invention include 
nucleotides 1 to 36604 of SEQ ID NO: 5. The Pan7 nucleic acid sequences of the 

10 invention include nucleotides 1 to 36535 of SEQ ID NO: 9. The SV1 nucleic acid 

sequences of the invention include nucleotides 1 to 34264 of SEQ ID NO: 24. The SV25 
nucleic acid sequences of the invention include nucleotides 1 to 31044 of SEQ ID NO: 
29. The SV39 nucleic acid sequences of the invention include nucleotides 1 to 341 15 of 
SEQ ID NO: 34. See, Sequence Listing, which is incorporated by reference herein. 

1 5 The nucleic acid sequences of the invention further encompass the 

strand which is complementary to the sequences of SEQ ID NO: 5, 9, 24, 29 and 34, 
as well as the RNA and cDNA sequences corresponding to the sequences of these 
sequences figures and their complementary strands. Further included in this invention 
are nucleic acid sequences which are greater than 95 to 98%, and more preferably 

20 about 99 to 99.9% homologous or identical to the Sequence Listing. Also included in 
the nucleic acid sequences of the invention are natural variants and engineered 
modifications of the sequences provided in SEQ ID NO: 5, 9, 24, 29 and 34 and their 
complementary strands. Such modifications include, for example, labels that are 
known in the art, methylation, and substitution of one or more of the naturally 

2 5 occurring nucleotides with a degenerate nucleotide. 

The invention further encompasses fragments of the sequences of 
Pan5, Pan6, Pan7, SV1, SV25 and SV39, their complementary strand, cDNA and 
RNA complementary thereto. Suitable fragments are at least 15 nucleotides in length, 
and encompass functional fragments, i.e., fragments which are of biological interest. 

30 For example, a functional fragment can express a desired adenoviral product or may 
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be useful in production of recombinant viral vectors. Such fragments include the gene 
sequences and fragments listed in the tables below. 

The following tables provide the transcript regions and open reading frames 
in the simian adenovirus sequences of the invention. For certain genes, the transcripts 

5 and open reading fi imes (ORFs) are located on the strand complementary to that 
presented in SEQ ID NO: 5, 9, 24, 29 and 34. See, e.g., E2b, E4 and E2a. The 
calculated molecular weights of the encoded proteins are also shown. Note that the 
Ela open reading frame. Pan5 [nt 576-1436 of SEQ ID NO:l], Pan6 [nt 576 to 1437 of 
SEQ ID NO: 5) and Pan7 [nt 576 to 1437 of SEQ ID NO: 9] contain internal splice 

l o sites. These splice sites are noted in the following tables. 
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AdPan-5[SEQIDNO:l] 



Regions 


Start 
(nt) 


End 
(nt) 


M.W. 
(Daltons) 


ITR 


I 


120 


- 


Ela 


Transcript 


478 




- 


13S 


576-664, 


1233- 
1436 


28120 


12S 


576-1046, 1233- 
1436 


24389 


9S 


576-644,1233- 

1436 


9962 


Transcript 




ID 10 


— 


Elb 


Transcript 


1552 






Small T 


1599 


2171 


22317 


Large T 


1904 


3412 


55595 


IX 


3492 


3920 


1 14427 


Transcript 




3959 




E2b 


Transcript • 








PTP 




OH J 1 


72930 


Polymerase 


8448 


JvOj 


12/257 


IVa2 


5604 


3980 




Transcript 




3960 




28.1kl 


D 


5155 


5979 


2ol41 


Agnoprotein 


7864 


8580 


25755 


LI 


Transcript 


10849 




■ 


52/5 5D 


10851 


12025 




Ilia 


12050 


13819 


o5oo9 


Transcript 




13832 




Transcript 


13894 






L2 


Penton 


13898 


15490 




VII 


15494 


16078 


21478 


V 


16123 


17166 


39568 


Mu 


17189 


17422 


8524 


transcript 




17442 




Transcript 


17488 






L3 


VI 


17491 


18222 


26192 


Hexon 


18315 


21116 


104874 


Endoprotease 


20989 


21783 


28304 


transcript 




21811 
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Ad Pan-5 (cont'd) [SEQ ID NO:l] 



Regions 


OUU L 


Fnrl 


IVI. fr. 
fl*)nltnnv) 


E2a 


Transcript 


26782 






DBP 


23386 


21845 


57358 


transcript 




21788 




L4 


Transcript 


23406 






lQOkD 


23412 


25805 


88223 


33 kD nomolog 


25525 


26356 


24538 


\/TlI 

VlJl 


26428 


27111 


24768 


transcript 




27421 




E3 


Transcript 


26788 




- 


On #1 


27112 


27432 


12098 


Orf #2 


27386 


28012 


23040 


Orf if* 


27994 


28527 


19525 


Hrf &4 


28557 


29156 


22567 


Orf 




2y/o3 


2220/ 


Orf #6 




3U0/3 


3 14 jo 


Orf Ul 


jUOo I 




W4// 


Orf UR 




1 1 IDA 

3 135*0 


10023 


Orf ^0 


1 1 IRQ 


11 TO A 

j 1 / yo 


1 sua 






3 1 53 / 






1 ran scrips 


32032 






rioer 


32U3j 


55511 


47670 


transcript 




33443 




E4 


Transcript 


36135 






Orf7 


33710 


33462 


9191 


Orf6 


34615 


33710 


35005 


Orf4 


34886 


34521 


13878 


Orf3 


35249 


34896 


13641 


Orf2 


35635 


35246 


14584 


Orf 1 


36050 


35676 


13772 


Transcript 




33437 




ITR 




36343 


36462 
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AdPan-6 [SEQ ED NO: 5] 



Regions 


Start (nt) 


End 

(nt) 


M.W. 
(Daltons) 


ITR 


1 


123 




Ela 


transcript 


478 








I3S 


576-1143, 1229- 
1437 


28291 




12S 


576-1050, 1229- 
1437 


24634 




9S 


576-645, 1229- 
1437 


10102 




transcript 




1516 


- 


Elb 


transcript 


l j j j 




- 




Small T 


160A 


£m\ It* 


22315 




LargeT 


lysjj 


1& 1 1 


55594 




DC 


1AQQ 


3 726 


14427 




transcript • 




3965 


- 


E2b 


transcript 


10341 




- 




PTP 




OA < 1 

8451 


72570 




Polymerase 


R44S 




126907 




IVa2 




J7SO 


50452 




transcript 




jyOO 




LI 


transcript 


1 Uoio 




- 




52/55 kD 


! f\2A(\ 
1 Uo^fU 


1ZU \L 


• 44205 




Ilia 


i ZKJjO 




65460 




Transcript 




13812 


- 


28.1 kd 


5161 


5985 


28012 


Agnoprotein 


7870 


8580 


25382 


L2 


transcript 


13874 




- 




Penton 


13878 


15467 


59314 




VII 


15471 


16055 


21508 




V 


16100 


17137 


39388 




Mu 


17160 


17393 


8506 




transcript 




17415 


- 


L3 


transcript 


17466 








VI 


17469 


18188 


25860 




Hexon 


18284 


21112 


106132 




Endoprotease 


21134 


21754 


23445 




transcript * 




21803 




E2a 


transcript 


26780 








DBP 


23375 


21837 


57299 




transcript 




21780 
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AdPan-6 (cont'd) [SEQ ID NO:5] 



Regions 


Start (nt)" 


End 
(nt) 


M.W. 
(Daltom) 


T A 

1A 


Transcript 


23398 








lOOkD 


23404 


25806 


88577 




33 kD homolog 


25523 


26357 


24609 




vm 


26426 


27109 


24749 




transcript 




27419 


- 


E3 


transcript 


26786 




- 




Orf#l 


27110 


27430 


12098 




Orf#2 


27384 


28007 


22880 




Orf#3 


27989 


28519 


19460 




Orf#4 


28553 


29236 


25403 




Orf#5 ! 


29249 


29860 


22350 




Orf#6 


29875 


30741 


31028 




Orf#7 


30749 


31024 


10469 




Orf#8 


31030 


31464 


16540 




Orf#9 


31457 


31864 


'15264 




transcript 




31907 


- 


L5 


transcript 


32159 








Fiber 


32162 


33493 


47364 




transcript 




33574 


- 


E4 


transcript 


36276 




- 




Orf7 


33841 


33593 


9177 




Orf6 


34746 


33841 


35094 




Orf4 


35017 


34652 


13937 




Orf3 


35380 


35027 


13627 




Orf2 


35766 


35377 


14727 




Orf 1 


36181 


35807 


13739 




transcript 




33558 




ITR 




36482 


36604 
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AdPan-7 [SEQIDNO:9] 



Regions 


Start (nt) 


End(nt) 


M.W. (Daltons) 


ITR 




1 


132 




Ela 


transcript 


478 








13S 


576-1143, 1229-1437 


28218 




12S 


576- 1050, 1229- 1437 


24561 




9S 


576-645, 
1229-1437 


10102 




transcript 




1 51 A 
I MO 




Elb 


transcript 


1553 








Small T 


1600 


2178 






LargeT 


1905 


3419 






IVa2 


3992 


5616 






transcript 




3971 




E2b 


transcript 


10341 








FTP 


10340 


8457 


72297 




Polymerase 


8451 


5095 


1 26994 




IX 


3504 


3932 


14441 




transcript 




3972 




28.1kD 


5167 


5991 


28028 


Agnoprotein 


| 7876 


8586 


25424 


LI 


transcript 


10834 








52/55 kD 


10836 


12011 


44302 




Ilia 


12035 


13795 


65339 




transcript 




13808 


- 


L2 


transcript 


13870 




- 




Penton 


13874 


15469 


59494 




VII 


15473 


16057 


21339 




V 


16102 


17139 


39414 




Mu 


17167 


17400 


8506 




transcript 




17420 


- 


L3 


transcript 


17467 








VI 


17470 


18198 


26105 




Hexon 


18288 


21086 


104763 




Endoprotease 


21106 


21732 


23620 




transcript 




21781 




E2a 


transcript 


26764 








DBP 


23353 


21815 


j 57199 




transcript 




21755 
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Ad Pan-7 (cont'd) [SEQ ID NO: 9] 


Regions 


Start (nt) 


End (nt) 


M.W. (Daltons) 


L4 


transcript 


23370 




- 




lOOkD 


23376 


25781 


88520 




33 kD 
homolog 


25489 


26338 


25155 




VIII 


26410 


27093 


24749 




transcript 




27403 


- 


E3 


transcript 


26770 




m 




Orf#l 


27094 


27414 


12056 




Orf#2 


27368 


27988 


22667 




Orf#3 


27970 


28500 


19462 




Orf#4 


28530 


29150 


22999 




Orf#5 


29163 


29777 


22224 




Orf#6 


29792 


30679 


32153 




Orf#7 


30687 


30962 


10511 




Orf#8 


30968 


31399 


16388 




Orf#9 


31392 


31799 


15205 




transcript 




31842 


- 


L5 


transcript 


32091 




- 




Fiber 


32094 


33425 


47344 




transcript 




33517 


- 


I E4 


transcript 


36208 








Orf7 


33784 


33536 


9191 




Orf6 


34689 


33784 


35063 




Orf4 


34960 


34595 


13879 




Orf3 


35323 


34970 


13641 




Orf2 


35709 


35320 


14644 




Orf 1 


36123 


35749 


13746 




transcript 




33501 




ITR 




36404 


36535 


m 
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AdSV-1 




Ad SV-25 


AdSV-39 




[SEQIDNO:24] 


[SEQIDNO:29] 


[SEQ ID NO: 34] 


Region 


Start 


End 


Start 


End 


Start 


End 


ITR 


1 


106 


1 


133 


1 


150 


Ela 


352 


1120 


- 


- 


404 


1409 


Elb 


1301 


2891 


359 


2273 


1518 


3877 


E2b 


9257 


2882 


9087 


2754 


10143 


3868 


E2a 


24415 


20281 


24034 


20086 


25381 


21228 


E3 


24974 


27886 


24791 


25792 


25790 


29335 


E4 


33498 


30881 


30696 


28163 


33896 


31157 


ITR 


34145 


34264 


30912 


31044 


33966 


34115 






AdSV-1 




Ad SV-25 




Ad SV-39 






[SEQIDNO:24] 


[SEQIDNO:29] 


[SEQ ID NO: 34] 


Region 


Start 


End 


Start 


End 


Start 


End 


ITR 


1 


106 


1 


133 


1 


150 


LI 


9513 


12376 


9343 


12206 


10416 


13383 


12 


12453 


15858 


12283 


15696 


13444 


16877 


L3 


15910 


20270 


15748 


20080 


17783 


21192 


L4 


21715 


25603 


21526 


25420 


22659 


26427 


L5 


28059 


30899 


25320 


28172 


29513 


31170 


ITR 


34145 


34264 


30912 


31044 


33966 


34115 



13 



WO 03/046124 



PCT/US02/33645 





protein 


Ad SV-l,SEQIDNO: 24 






Start 


End 


M.W. 


ITR 




1 


106 


- 


Ela 


13S 


459 


953 


18039 


12S 








Elb 


Small T 








LargeT 


1301 


2413 


42293 


IX 


2391 


2885 


16882 


E2b 


IVa2 


4354 


2924 


54087 


Polymerase 


6750 


4027 


102883 


FTP 


9257 


7371 


72413 




Agno-protein 


6850 


7455 


20984 


LI 


52/55 kD 


9515 


10642 


42675 


Ilia 


10663 


12372 


636568 


L2 


Penton 


12454 


13965 


56725 


vn 


13968 


14531 


20397 


V 


14588 


15625 


39374 


Mu 


15645 


15857 


7568 


L3 


VI 


15911 


16753 


30418 


Hexon 


16841 


19636 


104494 


Endoprotease 


19645 


20262 


23407 


2a 


DBP 


21700 


20312 


52107 


L4 


lOOkD 


21721 


24009 


85508 


VIII 


24591 


25292 


25390 
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protein 


Ad SV-1 (cont'd) 
SEQIDNO:24 






Start 


End 


U.W. 


E3 


Orf#l 


25292 


25609 


11950 




Orf#2 


25563 


26081 


18940 




Orf#3 


26084 


26893 


30452 




Orf#4 


26908 


27180 


10232 




Orf#5 


27177 


17512 


12640 




Orf#6 


27505 


27873 


13639 


L5 


Fiber #2 


28059 


29150 


39472 




Fiber #1 


29183 


30867 


61128 


E4 


Orf7 


31098 


30892 


7837 




Orf6 


31982 


31122 


33921 




Orf4 


32277 


31915 


14338 




Orf3 


32629 


32279 


13386 




Orf2 


33018 


32626 


14753 




Orf 1 


33423 


33043 


14301 


ITR 




34145 


34264 
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protein 


Ad SV-25, SEQIDNO:29 


AdSV-39, SEQIDNO:34 






Start 


End 


M.M. 


Start 


End 


MW. 


ITR 




1 


133 


- 


1 


150 


- 


Ela 


13S 








492 


1355 


28585 


12S ' 








492 


1355 


25003 


Elb 


Small T 


478 


1030 


20274 


1518 


2075 


21652 | 


Large T 


829 


2244 


52310 


1823 


3349 


55534 


K 


2306 


2716 


13854 


3434 


3844 


14075 


E2b 


IVa2 


4208 


2755 


54675 


3912 


5141 


46164 


Poly- 
merase 


6581 


3858 


102839 


7753 


5033 


103988 


FTP 


9087 


7207 1 


71326 


10143 


8335 


69274 




Agno- 
protein 


6681 


7139 


16025 


- 


- 


- 


LI 


52/55 kD 


9345 


10472 


42703 


10418 


11608 


44232 


Ilia 


10493 


12202 


63598 


11574 


13364 


66078 


L2 


Penton 


12284 


13801 


56949 


13448 


14959 


56292 


VII 


13806 


14369 


20369 


14960 


15517 


20374 


V 


14426 


15463 


39289 


15567 


16628 


39676 ! 


Mu 


15483 


15695 


7598 


16650 


16871 


7497 


L3 


VI 


15749 


16591 


30347 


16925 


17695 


28043 


Hexon 


16681 


19446 


104035 


17785 


20538 


102579 


Endo- 
protease 


19455 


20072 


23338 


20573 


21181 


22716 


2a 


DBP 


215.11 


20123 


52189 


22631 


21231 


53160 


L4 


lOOkD 


21532 


23829 


85970 


22659 


25355 


100362 


VIII 


24408 


25109 


25347 


25410 


26108 


25229 
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protein 


AdSV-25, SEQIDNO:29 
(cont'd) 


Ad SV-39, SEQIDNO:34, 
(cont'd) 






Start . 


End 


M.W. 


Start 


End 


M.W. 


E3 


Orf#l 


25109 


25426 


11890 


26375 


27484 


42257 


Orf#2 








27580 


28357 


29785 \ 


Orf#3 








28370 


28645 


10514 


Orf#4 








28863 


29333 


18835 


Orf#5 


• 












Orf#6 














L5 


Fiber #2 


25380 


26423 


37529 










Fiber #1 


26457 


28136 


60707 


29515 


31116 


56382 


E4 


Orf7 








31441 


31118 


11856 


Orf6 


29255 


28395 


33905 


32292 


31438 


33437 


Orf4 


29550 


29188 


14399 


32587 


32222 


13997 


Orf3 


29902 


29552 


13284 


32954 


32607 


13353 


Orf2 


30291 


29899 


14853 


33348 


32959 


14821 


Orf 1 


30316 


30696 


14301 


33764 


33378 


14235 


ITR 




30912 


31044 




33966 


34115 





The Pan5, Pan6, Pan7, SV I, SV25 and SV39 adenoviral nucleic acid 
5 sequences are useful as therapeutic agents and in construction of a variety of vector 
systems and host cells. As used herein, a vector includes any suitable nucleic acid 
molecule including, naked DNA, a plasmid, a virus, a cosmid, or an episome. These 
sequences and products may be used alone or in combination with other adenoviral 
sequences or fragments, or in combination with elements from other adenoviral or non- 
10 adenoviral sequences. The adenoviral sequences of the invention are also useful as 

antisense delivery vectors, gene therapy vectors, or vaccine vectors. Thus, the invention 
further provides nucleic acid molecules, gene delivery vectors, and host cells which 
contain the Ad sequences of the invention. 



17 



WO 03/046124 



PCT/US02/3364S 



For example, the invention encompasses a nucleic acid molecule 
containing simian Ad ITR sequences of the invention. In another example, the invention 
provides a nucleic acid molecule containing simian Ad sequences of the invention 
encoding a desired Ad gene product. Still other nucleic acid molecule constructed using 
5 the sequences of the invention will be readily apparent to one of skill in the art, in view of 
the information provided herein. 

In one embodiment, the simian Ad gene regions identified herein may be 
used in a variety of vectors for delivery of a heterologous molecule to a cell. For 
example, vectors are generated for expression of an adenoviral capsid protein (or 
l o fragment thereof) for purposes of generating a viral vector in a packaging host cell. Such 
vectors may be designed for expression in trans. Alternatively, such vectors are designed 
to provide cells which stably contain sequences which express desired adenoviral 
functions, e.g., one or more of Ela, Elb, the terminal repeat sequences, E2a, E2b, E4, 
E40RF6 region. 

15 In addition, the adenoviral gene sequences and fragments thereof are 

useful for providing the helper functions necessary for production of helper-dependent 
viruses (e.g., adenoviral vectors deleted of essential functions or adeno-associated viruses 
(AAV)). For such production methods, the simian adenoviral sequences of the invention 
are utilized in such a method in a manner similar to those described for the human Ad. 

20 However, due to the differences in sequences between the simian adenoviral sequences of 
the invention and those of human Ad, the use of the sequences of the invention essentially 
eliminate the possibility of homologous recombination with helper functions in a host cell 
carrying human Ad El functions, e.g., 293 cells, which may produce infectious 
adenoviral contaminants during rAAV production. 

25 Methods of producing rAAV using adenoviral helper functions have been 

described at length in the literature with human adenoviral serotypes. See, e.g., US Patent 
6,258,595 and the references cited therein. See, also, US Patent 5,871,982; WO 
99/14354; WO 99/15685; WO 99/47691. These methods may also be used in production 
of non-human serotype AAV, including non-human primate AAV serotypes. The simian 

30 adenoviral gene sequences of the invention which provide the necessary helper functions 
(e.g., Ela, Elb, E2a and/or E4 ORF6) can be particularly useful in providing the 
necessary adenoviral function while minimizing or eliminating the possibility of 
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recombination with any other adenoviruses present in the rAAV-packaging cell which are 
typically of human origin. Thus, selected genes or open reading frames of the adenoviral 
sequences of the invention may be utilized in these rAA V production methods. 

Alternatively, recombinant adenoviral simian vectors of the invention may 
5 be utilized in these methods. Such recombinant adenoviral simian vectors may include, 
e.g., a hybrid chimp Ad/AAV in which chimp Ad sequences flank a rAAV expression 
cassette composed o£ e.g., AAV 3' and/or 5* ITRs and a transgene under the control of 
regulatory sequences which control its expression. One of skill in the art will recognize 
that still other simian adenoviral vectors and/or gene sequences of the invention will be 
useful for production of rAAV and other viruses dependent upon adenoviral helper. 

In still another embodiment, nucleic acid molecules are designed for 
delivery and expression of selected adenoviral gene products in a host cell to achieve a 
desired physiologic effect. For example, a nucleic acid molecule containing sequences 
encoding an adenovirus Ela protein of the invention may be delivered to a subject for use 
as a cancer therapeutic. Optionally, such a molecule is formulated in a lipid-based carrier 
and preferentially targets cancer cells. Such a formulation may be combined with other 
cancer therapeutics (e.g., cisplatin, taxol, or the like). Still other uses for the adenoviral 
sequences provided herein will be readily apparent to one of skill in the art. 

In addition, one of skill in the art will readily understand that the Ad 
sequences of the invention can be readily adapted for use for a variety of viral and non- 
viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
immunogenic molecules. For example, the Pan5, Pan6, Pan7, SV1, SV25 and/or SV39 
simian Ad genomes of the invention can be utilized in a variety of rAd and non-rAd 
vector systems. Such vectors systems may include, e.g., plasmids, lentiviruses, 
retroviruses, poxviruses, vaccinia viruses, and adeno-associated viral systems, among 
others. Selection of these vector systems is not a limitation of the present invention! 

The invention further provides molecules useful for production of the 
simian and simian-derived proteins of the invention. Such molecules which carry 
polynucleotides including the simian Ad DNA sequences of the invention can be in the 
form of naked DNA, a plasmid, a virus or any other genetic element. 
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B. Simian Adenoviral Proteins of the Invention 

The invention further provides gene products of the above adenoviruses, 
such as proteins, enzymes, and fragments thereof, which are encoded by the adenoviral 
nucleic acids of the invention. The invention further encompasses Pan5, Pan6 and Pan7, 

5 SV1, SV25 and SV39 proteins, enzymes, and fragments thereof, having the amino acid 
sequences encoded by these nucleic acid sequences which are generated by other 
methods. Such proteins include those encoded by the open reading frames identified in 
the tables above, in Figs. 1 and 2, and fragments thereof. 

Thus, in one aspect, the invention provides unique simian adenoviral 

10 proteins which are substantially pure, i.e., are free of other viral and proteinaceous 

proteins. Preferably, these proteins are at least 10% homogeneous, more preferably 60% 
homogeneous, and most preferably 95% homogeneous. 

In one embodiment, the invention provides unique simian-derived capsid 
proteins. As used herein, a simian-derived capsid protein includes any adenoviral capsid 

15 protein that contains a Pan5, Pan6, Pan7, SV1 , SV25 or SV39 capsid protein or a 
fragment thereof, as defined above, including, without limitation, chimeric capsid 
proteins, fusion proteins, artificial capsid proteins, synthetic capsid proteins, and 
recombinantly capsid proteins, without limitation to means of generating these proteins. 

Suitably, these simian-derived capsid proteins contain one or more Pan5, 

20 Pan6, Pan7, SV1, SV25 or SV39 regions or fragments thereof (e.g., a hexon, penton, fiber 
or fragment thereof) in combination with capsid regions or fragments thereof of different 
adenoviral serotypes, or modified simian capsid proteins or fragments, as described 
herein. A "modification of a capsid protein associated with altered tropism" as used 
herein includes an altered capsid protein, i.e, a penton, hexon or fiber protein region, or 

25 fragment thereof, such as the knob domain of the fiber region, or a polynucleotide 
encoding same, such that specificity is altered. The simian-derived capsid may be 
constructed with one or more of the simian Ad of the invention or another Ad serotypes 
which may be of human or non-human origin. Such Ad may be obtained from a variety of 
sources including the ATCC, commercial and academic sources, or the sequences of the 

30 Ad may be obtained from GenBank or other suitable sources. 

The amino acid sequences of the simian adenoviruses penton proteins of 
the invention are provided herein. The AdPanS penton protein is provided in SEQ ID 
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NO:2. The AdPan7 penton is provided in SEQ ID NO:6. The AdPan6 penton is 
provided in SEQ ID NO:10. The SV1 penton is provided in SEQ ID NO:25. The SV25 
penton protein is provided in SEQ ED NO:30. The SV39 penton is provided in SEQ ID 
NO:35. Suitably, any of these penton proteins, or unique fragments thereof, may be 
5 utilized for a variety of purposes. Examples of suitable fragments include the penton 
having N-terminal a;id/or C-terminal truncations of about 50, 100, 150, or 200 amino 
acids, based upon the amino acid numbering provided above and in SEQ ID NO:2; SEQ 
ID NO:6; SEQ ID NO:25; SEQ ID NO:30, or SEQ ID NO:35. Other suitable fragments 
include shorter internal, C-terminal, or N-terminal fragments. Further, the penton protein 

10 may be modified for a variety of purposes known to those of skill in the art. 

The invention further provides the amino acid sequences of the hexon 
protein of Pan5 [SEQ ID NO:3], Pan6 [SEQ ID NO:7], Pan 7 [SEQ ID NO:l 1], SV1 
[SEQ ID NO:26], SV25 [SEQ ID NO:31], and/or SV39 [SEQ ID NO:36]. Suitably, this 
hexon protein, or unique fragments thereof, may be utilized for a variety of purposes. 

15 Examples of suitable fragments include the hexon having N-terminal and/or C-terminal 
truncations of about 50, 100, 150, 200, 300, 400, or 500 amino acids, based upon the 
amino acid numbering provided above and in SEQ ID NO: 3, 7, 1 1, 26, 31 and 36. Other 
suitable fragments include shorter internal, C-terminal, or N-terminal fragments. For 
example, one suitable fragment the loop region (domain) of the hexon protein, designated 

20 DEI and FG1, or a hypervariable region thereof. Such fragments include the regions 
spanning amino acid residues about 125 to 443; about 138 to 441, or smaller fragments, 
such as those spanning about residue 1 38 to residue 163; about 170 to about 176; about 
195 to about 203; about 233 to about 246; about 253 to about 264; about 287 to about 
297; and about 404 to about 430 of the simian hexon proteins, with reference to SEQ ID 

25 NO: 3, 7, 1 1, 26, 31 or 36. Other suitable fragments may be readily identified by one of 
skill in the art. Further, the hexon protein may be modified for a variety of purposes 
known to those of skill in the art. Because the hexon protein is the determinant for 
serotype of an adenovirus, such artificial hexon proteins would result in adenoviruses 
having artificial serotypes. Other artificial caps id proteins can also be constructed using 

30 the chimp Ad penton sequences and/or fiber sequences of the invention and/or fragments 
thereof. 
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In one example, it may be desirable to generate an adenovirus having an 
altered hexon protein utilizing the sequences of a hexon protein of the invention. One 
suitable method for altering hexon proteins is described in US Patent 5,922,315, which is 
incorporated by reference. In this method, at least one loop region of the adenovirus 
5 hexon is changed with at least one loop region of another adenovirus serotype. Thus, at 
least one loop region of such an altered adenovirus hexon protein is a simian Ad hexon 
loop region of the invention (e.g. Pan7). In one embodiment, a loop region of the Pan7 
hexon protein is replaced by a loop region from another adenovirus serotype. In another 
embodiment, the loop region of the Pan7 hexon is used to replace a loop region from 

10 another adenovirus serotype. Suitable adenovirus serotypes may be readily selected from 
among human and non-human serotypes, as described herein, Pan7 is selected for 
purposes of illustration only; the other simian Ad hexon proteins of the invention may be 
similarly altered, or used to alter another Ad hexon. The selection of a suitable serotype 
is not a limitation of the present invention. Still other uses for the hexon protein 

15 sequences of the invention will be readily apparent to those of skill in the art. 

The invention further encompasses the fiber proteins of the simian 
adenoviruses of the invention. The fiber protein of AdPan 5 has the amino acid sequence 
of SEQ ID NO:4. The fiber protein AdPan6 has the amino acid sequence of SEQ ID NO: 
8. The fiber protein of AdPan7 has the amino acid sequence of SEQ ID NO: 12. SV-1 

20 has two fiber proteins; fiber 2 has the amino acid sequence of SEQ ID NO:27 and fiber 1 
has the amino acid sequence of SEQ ID NO:28. SV-25 also has two fiber proteins; fiber 
2 has the amino acid sequence of SEQ ID NO:32 and fiber 1 has the amino acid sequence 
of SEQ ID NO:33. The fiber protein of SV-39 has the amino acid sequence of SEQ ID 
NO:37. 

25 Suitably, this fiber protein, or unique fragments thereof, may be utilized 

for a variety of purposes. One suitable fragment is the fiber knob, which spans about 
amino acids 247 to 425 of SEQ ID NO: 4, 8, 12, 28, 32, 33 and 37. See Fig. 2. Examples 
of other suitable fragments include the fiber having N-terminal and/or C-terminal 
truncations of about 50, 100, 1 50, or 200 amino acids, based upon the amino acid 

30 numbering provided above and in SEQ ID NO: 4, 8, 12, 28, 32, 33 and 37. Still other 
suitable fragments include internal fragments. Further, the fiber protein may be modified 
using a variety of techniques known to those of skill in the art. 
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The invention further encompasses unique fragments of the proteins of the 
invention which are at least 8 amino acids in length. However, fragments of other desired 
lengths can be readily utilized. In addition, the invention encompasses such 
modifications as may be introduced to enhance yield and/or expression of a Pan5, Pan6, 
5 Pan7, SV1, SV25 or SV39 gene product, e.g., construction of a fusion molecule in which 
all or a fragment of the Pan5, Pan6, Pan7, SV1, SV25 or SV39 gene product is fused 
(either directly or via a linker) with a fusion partner to enhance. Other suitable 
modifications include, without limitation, truncation of a coding region (e.g., a protein or 
enzyme) to eliminate a pre- or pro-protein ordinarily cleaved and to provide the mature 

10 protein or enzyme and/or mutation of a coding region to provide a secretable gene 

product. Still other modifications will be readily apparent to one of skill in the art The 
invention further encompasses proteins having at least about 95% to 99% identity to the 
Pan5, Pan6, Pan7, SV1, SV25 or SV39 proteins provided herein. 

As described herein, vectors of the invention containing the adenoviral 

15 capsid proteins of the invention are particularly well suited for use in applications in 
which the neutralizing antibodies diminish the effectiveness of other Ad serotype based 
vectors, as well as other viral vectors. The rAd vectors of the invention are particularly 
advantageous in readministration for repeat gene therapy or for boosting immune 
response (vaccine titers). 

20 Under certain circumstances, it may be desirable to use one or more of the 

Pan5, Pan6, Pan7, SV1, SV25 and/or SV39 gene products (e.g., a capsid protein or a 
fragment thereof) to generate an antibody. The term "an antibody," as used herein, refers 
to an immunoglobulin molecule which is able to specifically bind to an epitope. Thus, 
the antibodies of the invention bind, preferably specifically and without cross-reactivity, 

25 to a Pan5, Pan6, Pan7, SV1 , SV25 or SV39 epitope. The antibodies in the present 
invention exist in a variety of forms including, for example, high affinity polyclonal 
antibodies, monoclonal antibodies, synthetic antibodies, chimeric antibodies, recombinant 
antibodies and humanized antibodies. Such antibodies originate from immunoglobulin 
classes IgG, IgM, IgA, IgD and IgE. 

30 Such antibodies may be generated using any of a number of methods 

know in the art Suitable antibodies may be generated by well-known conventional 
techniques, e.g. Kohler and Milstein and the many known modifications thereof. 
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Similarly desirable high titer antibodies are generated by applying known recombinant 
techniques to the monoclonal or polyclonal antibodies developed to these antigens [see, 
e.g., PCT Patent Application No. PCT/GB85/00392; British Patent Application 
Publication No. GB2188638A; Amit et al, 1986 Science, 233:147-753; Queen el al, 
5 1989 Proc. Nat'l. Acad. Sci. USA, 55:10029-10033; PCT Patent Application No. 

PCTYWO9007861; and Riechmann et al, Nature, 532:323-327 (1988); Huse et al, 1988a 
Science, 24*6": 1275- 1281]. Alternatively, antibodies can be produced by manipulating the 
complementarity determining regions of animal or human antibodies to the antigen of this 
invention. See, e.g., E. Mark and Padlin, " Human ization of Monoclonal Antibodies", 
l o Chapter 4, The Handbook of Experimental Pharmacology, Vol. 1 1 3, The Pharmacology 
of Monoclonal Antibodies, Springer-Verlag (June, 1 994);. Harlow et al, 1999, Using 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, NY; Harlow et 
al, 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York; Houston et 
al, 1988, Proc. Natl. Acad. Sci. USA 55:5879-5883; and Bird el al, 1988, Science 
15 24*2:423-426. Further provided by the present invention are anti-idiotype antibodies 
(Ab2) and anti-anti-idiotype antibodies (Ab3). See, e.g., M. Wettendorff et al., 
"Modulation of anti-tumor immunity by anti-idiotypic antibodies." In Idiotypic Network 
and Diseases, ed. by J. Cerny and J. Hiernaux, 1990 J. Am. Soc. Microbiol., Washington 
DC: pp. 203-229]. These anti-idiotype and anti-anti-idiotype antibodies are produced 
20 using techniques well known to those of skill in the art. These antibodies may be used for 
a variety of purposes, including diagnostic and clinical methods and kits. 

Under certain circumstances, it may be desirable to introduce a detectable 
label or a tag onto a Pan5, Pan6, Pan7, SV1, SV25 or SV39 gene product, antibody or 
other construct of the invention. As used herein, a detectable label is a molecule which is 
25 capable, alone or upon interaction with another molecule, of providing a detectable signal. 
Most desirably, the label is detectable visually, e.g. by fluorescence, for ready use in 
immunohistochemical analyses or immunofluorescent microscopy. For example, suitable 
labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), allophycocyanin 
(APC), coriphosphine-0 (CPO) or tandem dyes, PE-cyanin-5 (PCS), and PE-Texas Red 
30 (ECD). All of these fluorescent dyes are commercially available, and their uses known to 
the ait Other useful labels include a colloidal gold label. Still other useful labels include 
radioactive compounds or elements. Additionally, labels include a variety of enzyme 

24 
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systems that operate to reveal a colorimetric signal in an assay, e.g., glucose oxidase 
(which uses glucose as a substrate) releases peroxide as a product which in the presence 
of peroxidase and a hydrogen donor such as tetramethyl benzidine (TMB) produces an 
oxidized TMB that is seen as a blue color. Other examples include horseradish 
5 peroxidase (HRP) or alkaline phosphatase (AP), and hexokinase in conjunction with 

glucose-6-phosphate dehydrogenase which reacts with ATP, glucose, and NAD+ to yield, 
among other products, NADH that is detected as increased absorbance at 340 nm 
wavelength. 

Other label systems that are utilized in the methods of this invention are 

10 detectable by other means, e.g., colored latex microparticles [Bangs Laboratories, 

Indiana] in which a dye is embedded are used in place of enzymes to form conjugates 
with the target sequences provide a visual signal indicative of the presence of the 
resulting complex in applicable assays. 

Methods for coupling or associating the label with a desired molecule are 

15 similarly conventional and known to those of skill in the art. Known methods of label 
attachment are described [see, for example, Handbook of Fluorescent probes and 
Research Chemicals, 6th Ed., R, P. M, Haugland, Molecular Probes, Inc., Eugene, OR, 
1996; Pierce Catalog and Handbook, Life Science and Analytical Research Products, 
Pierce Chemical Company, Rockford, IL, 1994/1995]. Thus, selection of the label and 

20 coupling methods do not limit this invention. 

The sequences, proteins, and fragments of the invention may be produced 
by any suitable means, including recombinant production, chemical synthesis, or other 
synthetic means. Suitable production techniques are well known to those of skill in the 
art See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring 

25 Harbor Press (Cold Spring Harbor, NY). Alternatively, peptides can also be synthesized 
by the well known solid phase peptide synthesis methods (Merrifield, J. Am. Chem. Soc, 
85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San 
Francisco, 1969) pp. 27-62). These and other suitable production methods are within the 
knowledge of those of skill in the art and are not a limitation of the present invention. 

30 In addition, one of skill in the art will readily understand that the Ad 

sequences of the invention can be readily adapted for use for a variety of viral and non- 
viral vector systems for in vitro, ex v/vo or in vivo delivery of therapeutic and 

25 
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immunogenic molecules. For example, in one embodiment, the simian Ad capsid 
proteins and other simian adenovirus proteins described herein are used for non-viral, 
protein-based delivery of genes, proteins, and other desirable diagnostic, therapeutic and 
immunogenic molecules. In one such embodiment, a protein of the invention is linked, 

5 directly or indirectly, to a molecule for targeting to cells with a receptor for adenoviruses. 
Preferably, a capsid protein such as a hexon, penton, fiber or a fragment thereof having a 
ligand for a cell surface receptor is selected for such targeting. Suitable molecules for 
delivery are selected from among the therapeutic molecules described herein and their 
gene products. A variety of linkers including, lipids, polyLys, and the like may be 

10 utilized as linkers. For example, the simian penton protein may be readily utilized for 
such a purpose by production of a fusion protein using the simian penton sequences in a 
manner analogous to that described in Medina-Kauwe LK, et al, Gene Ther, 2001 May; 
8(10):795-803 and Medina-Kauwe LK, et al, Gene Ther. 2001 Dec; 8(23): 1753-1761. 
Alternatively, the amino acid sequences of simian Ad protein IX may be utilized for 

15 targeting vectors to a cell surface receptor, as described in US Patent Appln 

20010047081. Suitable ligands include aCD40 antigen, an RGD-containing or 
polylysine-containing sequence, and the like. Still other simian Ad proteins, including, 
e.g., the hexon protein and/or the fiber protein, may be used for used for these and similar 
purposes. 

20 Still other adenoviral proteins of the invention may be used as alone, or in 

combination with other adenoviral protein, for a variety of purposes which will be readily 
apparent to one of skill in the art. In addition, still other uses for the adenoviral proteins 
of the invention will be readily apparent to one of skill in die art. 

25 EL Recombinant Adenoviral Vectors 

The compositions of this invention include vectors that deliver a heterologous 
molecule to cells, either for therapeutic or vaccine purposes. As used herein, a vector 
may include any genetic element including, without limitation, naked DNA, a phage, 
transposon, cosmid, episome, plasmid, or a virus. Such vectors contain simian 

30 adenovirus DNA of Pan5, Pan6, Pan7, SV1 , SV25 and/or SV39 and a minigene. By 
"minigene" is meant the combination of a selected heterologous gene and the other 
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regulatory elements necessary to drive translation, transcription ancVor expression of the 
gene product in a host cell. 

Typically, an adenoviral vector of the invention is designed such that the 
minigene is located in a nucleic acid molecule which contains other adenoviral sequences 
5 in the region native to a selected adenoviral gene. The minigene may be inserted into an 
existing gene region to disrupt the function of that region, if desired. Alternatively, the 
minigene may be inserted into the site of a partially or fully deleted adenoviral gene. For 
example, the minigene may be located in the site of such as the site of a functional El 
deletion or functional E3 deletion, among others that may be selected. The term 

1 0 "functionally deleted" or "functional deletion" means that a sufficient amount of the gene 
region is removed or otherwise damaged, e.g., by mutation or modification, so that the 
gene region is no longer capable of producing functional products of gene expression. If 
desired, the entire gene region may be removed. Other suitable sites for gene disruption or 
deletion are discussed elsewhere in the application. 

15 For example, for a production vector useful for generation of a recombinant virus, 

the vector may contain the minigene and either the 5* end of the adenoviral genome or the 
3* end of the adenoviral genome, or both the 5' and 3* ends of the adenoviral genome. 
The 5' end of the adenoviral genome contains the 5 7 cis-elements necessary for packaging 
and replication; i.e., the 5 5 inverted terminal repeat (ITR) sequences (which functions as 

2 0 origins of replication) and the native 5' packaging enhancer domains (that contain 

sequences necessary for packaging linear Ad genomes and enhancer elements for the El 
promoter). The 3' end of the adenoviral genome includes the 3' cis-elements (including 
the ITRs) necessary for packaging and encapsidation. Suitably, a recombinant adenovirus 
contains both 5* and 3' adenoviral cis-elements and the minigene is located between the 

25 5' and 3* adenoviral sequences. Any adenoviral vector of the invention may also contain 
additional adenoviral sequences. 

Suitably, these adenoviral vectors of the invention contain one or more adenoviral 
elements derived from an adenoviral genome of the invention. In one embodiment, the 
vectors contain adenoviral ITRs from Pan5, Pan6, Pan7, SV1, SV25 or SV39 and 

30 additional adenoviral sequences from the same adenoviral serotype. In another 

embodiment, the vectors contain adenoviral sequences that are derived from a different 
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adenoviral serotype than that which provides the ITRs. As defined herein, a pseudotyped 
adenovirus refers to an adenovirus in which the capsid protein of the adenovirus is from a 
different serotype than the serotype which provides the ITRs. The selection of the 
serotype of the ITRs and the serotype of any other adenoviral sequences present in vector 

5 is not a limitation of the present invention. A variety of adenovirus strains are available 
from the American Type Culture Collection, Manassas, Virginia, or available by request 
from a variety of commercial and institutional sources. Further, the sequences of many 
such strains are available from a variety of databases including, e.g., PubMed and 
GenBank. Homologous adenovirus vectors prepared from other simian or from human 

10 adenoviruses are described in the published literature [see, for example, US Patent No, 
5,240,846]. The DNA sequences of a number of adenovirus types are available from 
GenBank, including type Ad5 [GenBank Accession No. M73260]. The adenovirus 
sequences may be obtained from any known adenovirus serotype, such as serotypes 2, 3, 
4, 7, 12 and 40, and further including any of the presently identified human types. 

15 Similarly adenoviruses known to infect non-human animals (e.g., simians) may also be 
employed in the vector constructs of this invention. See, e.g., US Patent No. 6,083,716. 

The viral sequences, helper viruses, if needed, and recombinant viral particles, 
and other vector components and sequences employed in the construction of the vectors 
described herein are obtained as described above. The DNA sequences of the Pan5, Pan6, 

20 Pan7, SV1, SV25 and/or SV39 simian adenovirus sequences of the invention are 

employed to construct vectors and cell lines useful in the preparation of such vectors. 

Modifications of the nucleic acid sequences forming the vectors of this invention, 
including sequence deletions, insertions, and other mutations may be generated using 
standard molecular biological techniques and are within the scope of this invention. 

25 it The "Minigene" 

The methods employed for the selection of the transgene, the cloning and 
construction of the "minigene" and its insertion into the viral vector are within the skill in 
the art given the teachings provided herein. 

1. The transgene 

30 The transgene is a nucleic acid sequence, heterologous to the 

vector sequences flanking the transgene, which encodes a polypeptide, protein, or other 
product, of interest. The nucleic acid coding sequence is operatively linked to regulatory 
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components in a manner which permits transgene transcription, translation, and/or 
expression in a host cell. 

The composition of the transgene sequence will depend upon the 
use to which the resulting vector will be put For example, one type of transgene 

5 sequence includes a reporter sequence, which upon expression produces a detectable 
signal. Such reporter sequences include, without limitation, DNA sequences encoding P- 
lactamase, p-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green 
fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, 
membrane bound proteins including, for example, CD2, CD4, CD8, the influenza 

10 hemagglutinin protein, and others well known in the art, to which high affinity antibodies 
directed thereto exist or can be produced by conventional means, and fusion proteins 
comprising a membrane bound protein appropriately fused to an antigen tag domain from, 
among others, hemagglutinin or Myc. These coding sequences, when associated with 
regulatory elements which drive their expression, provide signals detectable by 

15 conventional means, including enzymatic, radiographic, colorimetric, fluorescence or 
other spectrograph ic assays, fluorescent activating cell sorting assays and immunological 
assays, including enzyme linked immunosorbent assay (ELIS A), radioimmunoassay 
(RIA) and immunohistochemistry. For example, where the marker sequence is the LacZ 
gene, the presence of the vector carrying the signal is detected by assays for beta- 

20 galactosidase activity. Where the transgene is GFP or luciferase, the vector carrying the 
signal may be measured visually by color or light production in a luminometer. 

However, desirably, the transgene is a non-marker sequence encoding a 
product which is useful in biology and medicine, such as proteins, peptides, RNA, 
enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, dsRNA, 

25 ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful RNA 
sequence is a sequence which extinguishes expression of a targeted nucleic acid sequence 
in the treated animal. 

The transgene may be used for treatment, e.g., of genetic deficiencies, as a 
cancer therapeutic or vaccine, for induction of an immune response, and/or for 

30 prophylactic vaccine purposes. As used herein, induction of an immune response refers 
to the ability of a molecule (e.g., a gene product) to induce a T cell and/or a humoral 
immune response to the molecule. The invention further includes using multiple 
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transgenes, e.g., to correct or ameliorate a condition caused by a multi-subimit protein. In 
certain situations, a different transgene may be used to encode each subunit of a protein, 
or to encode different peptides or proteins. This is desirable when the size of the DNA 
encoding the protein subunit is large, e.g., for an immunoglobulin, the platelet-derived 
5 growth factor, or a lystrophin protein. In order for the cell to produce the multi-subunit 
protein, a eel! is infected with the recombinant virus containing each of the different 
subunits. Alternatively, different subunits of a protein may be encoded by the same 
transgene. In this case, a single transgene includes the DNA encoding each of the 
subunits, with the DNA for each subunit separated by an internal ribozyme entry site 

10 (IRES). This is desirable when the size of the DNA encoding each of the subunits is 

small, e.g., the total size of the DNA encoding the subunits and the IRES is less than five 
kilobases. As an alternative to an IRES, the DNA may be separated by sequences 
encoding a 2A peptide, which self-cleaves in a post-translational event. See, e.g., M.L. 
Donnelly, et al, J. Gen. Virol, 78(Pt I): 13-21 (Jan 1997); Furler, S., etal, Gene Then, 

15 8(1 1):864-873 (June 2001); Klump H., etal, Gene Ther., 8(10):81 1-817 (May 2001). 
This 2A peptide is significantly smaller than an IRES, making it well suited for use when 
space is a limiting factor. However, the selected transgene may encode any biologically 
active product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill in the art. The 

20 selection of die transgene is not considered to be a limitation of this invention. 
2. Regulatory Elements 

In addition to the major elements identified above for the 
minigene, the vector also includes conventional control elements necessary which are 
operably linked to the transgene in a manner that permits its transcription, translation 

25 and/or expression in a eel! transfected with the plasmid vector or infected with the virus 
produced by the invention. As used herein, "operably linked" sequences include both 
expression control sequences that are contiguous with the gene of interest and expression 
control sequences that act in trans or at a distance to control the gene of interest. 

Expression control sequences include appropriate transcription 

30 initiation, termination, promoter and enhancer sequences; efficient UNA processing 
signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize 
cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus 
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sequence); sequences that enhance protein stability; and when desired, sequences that 
enhance secretion of the encoded product. A great number of expression control 
sequences, including promoters which are native, constitutive, inducible and/or tissue- 
specific, are known in the. art and may be utilized. 
5 Examples of constitutive promoters include, without limitation, 

the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV 
enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) 
[see, e.g., Boshart etal, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate 
reductase promoter, the p-actin promoter, the phosphoglycerol kinase (PGK) promoter, 

10 and the EFla promoter [Invitrogen]. 

Inducible promoters allow regulation of gene expression and can 
be regulated by exogenously supplied compounds, environmental factors such as 
temperature, or the presence of a specific physiological state, e.g., acute phase, a 
particular differentiation state of the cell, or in replicating cells only. Inducible 

15 promoters and inducible systems are available from a variety of commercial sources, 

including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have 
been described and can be readily selected by one of skill in the art. For example, 
inducible promoters include the zinc-inducible sheep metallothionine (MT) promoter and 
the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter. 

20 Other inducible systems include the T7 polymerase promoter system [WO 98/10088]; the 
ecdysone insect promoter [No et al Proc. Natl Acad. Set USA, 93:3346-3351 (1996)], 
the tetracycline-repressible system [Gossen et al, Proc, Natl Acad Set USA, 89:5547- 
5551 (1992)], the tetracycline-inducible system [Gossen etal, Science, 268:1766-1769 
(1995), see also Harvey et al, Curr. Opin. Chem. Biol, 2:512-518 (1998)], Other systems 

25 include the FK506 dimer, VP16 or p65 using castradiol, diphenol murislerone, the 

RU486-inducible system [Wang et al, Nat Biotech, 15:239-243 (1 997) and Wang et al, 
Gene Ther^ 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, J. Clin. 
Invest., 100:2865-2872 (1997)]. The effectiveness of some inducible promoters 
increases over time. In such cases one can enhance the effectiveness of such systems by 

30 inserting multiple repressors in tandem, e.g., TetR linked to a TetR by an IRES. 

Alternatively, one can wait at least 3 days before screening for the desired function. One 
can enhance expression of desired proteins by known means to enhance the effectiveness 
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of this system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional 
Regulatory Element (WPRE). 

In another embodiment, the native promoter for the transgene will 
be used. The native promoter may be preferred when it is desired that expression of the 
5 transgene should mimic the native expression. The native promoter may be used when 
expression of the transgene must be regulated temporally or developmentally, or in a 
tissue-specific manner, or in response to specific transcriptional stimuli. In a further 
embodiment, other native expression control elements, such as enhancer elements, 
polyadenylation sites or Kozak consensus sequences may also be used to mimic the 
1 0 native expression . 

Another embodiment of the transgene includes a transgene 
operably linked to a tissue-specific promoter. For instance, if expression in skeletal 
muscle is desired, a promoter active in muscle should be used. These include the 
promoters from genes encoding skeletal 3-actin, myosin light chain 2A, dystrophin, 

15 muscle creatine kinase, as well as synthetic muscle promoters with activities higher than 
• naturally occurring promoters (see Li et al, Nat. Biotech, 17:241-245 (1999)). Examples 
of promoters that are tissue-specific are known for liver (albumin, Miyatake et al, J. 
Virol, 71:5124-32 (1997); hepatitis B virus core promoter, Sandig et al, Gene Ther, 
3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnote/a/., Hum. Gene Ther., 7:1503-14 

20 (1 996)), bone osteocalcin (Stein et al, Mol Biol Rep., 24:1 85-96 (1997)); bone 

sialoprotein (Chen et al, J. Bone Miner. Res., 11:654-64 (1 996)), lymphocytes (CD2, 
Hansal et al, J. Immunol, 161:1063-8 (1998); immunoglobulin heavy chain; T cell 
receptor chain), neuronal such as neuron-specific enolase (NSE) promoter (Andersen et 
al, Cell Mol NeurobioL, 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et 

25 al, Proc. Natl Acad. Set USA, 88:561 1-5 (1991)), and the neuron-specific vgf gene 
(Piccioli et al, Neuron, 15:373-84 (1995)), among others. 

Optionally, vectors carrying transgenes encoding therapeutically useful or 
immunogenic products may also include selectable markers or reporter genes may include 
sequences encoding geneticin, hygromicin or purimycin resistance, among others. Such 

30 selectable reporters or marker genes (preferably located outside the viral genome to be 
packaged into a viral particle) can be used to signal the presence of the plasm ids in 
bacterial cells, such as ampicillin resistance. Other components of the vector may include 



WO 03/046124 



PCT/US02/33645 



an origin of replication. Selection of these and other promoters and vector elements are 
conventional and many such sequences are available [see, e.g., Sambrook et al, and 
references cited therein]. 

These vectors are generated using the techniques and sequences provided 
5 herein, in conjunction with techniques known to those of skill in the art. 

Such techniques inciude conventional cloning techniques of cDNA such as those 
described in texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Press, Cold Spring Harbor, NY], use of overlapping oligonucleotide 
sequences of the adenovirus genomes, polymerase chain reaction, and any suitable 
10 method which provides the desired nucleotide sequence. 

m. Production of the Recombinant Viral Particle 

In one embodiment, the simian adenoviral plasmids (or other vectors) are used to 
produce recombinant adenoviral particles. In one embodiment, the recombinant 

15 adenoviruses are functionally deleted in the El a or Elb genes, and optionally bearing 
other mutations, e.g., temperature-sensitive mutations or deletions in other genes. In 
other embodiments, it is desirable to retain an intact Ela and/or Elb region in the 
recombinant adenoviruses. Such an intact El region may be located in its native location 
in the adenoviral genome or placed in the site of a deletion in the native adenoviral 

20 genome (e.g., in the E3 region). 

In the construction of useful simian adenovirus vectors for delivery of a gene to 
the human (or other mammalian) cell, a range of adenovirus nucleic acid sequences can 
be employed in the vectors. For example, all or a portion of the adenovirus delayed early 
gene E3 may be eliminated from the simian adenovirus sequence which forms a part of 

25 the recombinant virus. The function of simian E3 is believed to be irrelevant to the 
function and production of the recombinant virus particle. Simian adenovirus vectors 
may also be constructed having a deletion of at least the ORF6 region of the E4 gene, and 
more desirably because of the redundancy in the function of this region, the entire E4 
region. Still another vector of this invention contains a deletion in the delayed early gene 

30 E2a. Deletions may also be made in any of the iate genes LI through L5 of the simian 
adenovirus genome. Similarly, deletions in the intermediate genes DC and IVa 2 may be 
useful for some purposes. Other deletions may be made in the other structural or non- 
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structural adenovirus genes. The above discussed deletions may be used individually, 
i.e., an adenovirus sequence for use in the present invention may contain deletions in only 
a single region. Alternatively, deletions of entire genes or portions thereof effective to 
destroy their biological activity may be used in any combination. For example, in one 
5 exemplary vector, the adenovirus sequence may have deletions of the El genes and the 
E4 gene, or of the El, E2a and E3 genes, or of the El and E3 genes, or of El, E2a and E4 
genes, with or without deletion of E3, and so on. As discussed above, such deletions may 
be used in combination with other mutations, such as temperature-sensitive mutations, to 
achieve a desired result. 

10 An adenoviral vector lacking any essential adenoviral sequences (e.g., El a, El b, 

E2a, E2b, E4 ORF6, LI, L2, L3, L4 and L5) may be cultured in the presence of the 
missing adenoviral gene products which are required for viral infectivity and propagation 
of an adenoviral particle. These helper functions may be provided by culturing the 
adenoviral vector in the presence of one or more helper constructs (e.g., a plasmid or 

15 virus) or a packaging host cell. See, for example, the techniques described for preparation 
of a "minimal" human Ad vector in International Patent Application W096/13597. 
published May 9, 1996, and incorporated herein by reference. 
1, Helper Viruses 

Thus, depending upon the simian adenovirus gene content of the 

20 viral vectors employed to carry the minigene, a helper adenovirus or non-replicating virus 
fragment may be necessary to provide sufficient simian adenovirus gene sequences 
necessary to produce an infective recombinant viral particle containing the minigene. 
Useful helper viruses contain selected adenovirus gene sequences not present in the 
adenovirus vector construct and/or not expressed by the packaging cell line in which the 

25 vector is transfected. In one embodiment, the helper virus is replication-defective and 
contains a variety of adenovirus genes in addition to the sequences described above. 
Such a helper virus is desirably used in combination with an El -expressing cell line. 

Helper viruses may also be formed into poly-cation conjugates as 
described in Wu et al, J. Biol. Chem., 264:16985-16987 (1989); K. J. Fisher and J. M. 

30 Wilson, Biochem. J., 299:49 (April 1, 1994). Helper virus may optionally contain a 
second reporter minigene. A number of such reporter genes are known to the art. The 
presence of a reporter gene on the helper virus which is different from the transgene on 
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the adenovirus vector allows both the Ad vector and the helper virus to be independently 
monitored. This second reporter is used to enable separation between the resulting 
recombinant virus and the helper virus upon purification. 

2. Complementation Cell Lines 
5 To generate recombinant simian adenoviruses (Ad) deleted in any 

of the genes described above, the function of the deleted gene region, if essential to the 
replication and infectivity of the virus, must be supplied to the recombinant virus by a 
helper virus or cell line, i.e., a complementation or packaging cell line. In many 
circumstances, a cell line expressing the human El can be used to transcomplement the 

10 chimp Ad vector. This is particularly advantageous because, due to the diversity between 
the chimp Ad sequences of the invention and the human AdEl sequences found in 
currently available packaging cells, the use of the current human El -containing cells 
prevents the generation of replication-competent adenoviruses during the replication and 
production process. However, in certain circumstances, it will be desirable to utilize a 

15 cell line which expresses the El gene products can be utilized for production of an El- 
deleted simian adenovirus. Such cell lines have been described. See, e.g., US Patent 
6,083,716. 

If desired, one may utilize the sequences provided herein to 
generate a packaging cell or cell line that expresses, at a minimum, the adenovirus El 

20 gene from Pan5, Pan6, Pan7, SV1, SV25 or SV39 under the transcriptional control of a 
promoter for expression in a selected parent cell line. Inducible or constitutive promoters 
may be employed for this purpose. Examples of such promoters are described in detail 
elsewhere in this specification. A parent cell is selected for the generation of a novel cell 
line expressing any desired AdPanS, Pan6, Pan7, SV1, SV25 or SV39 gene. Without 

25 limitation, such a parent cell line may be HeLa [ATCC Accession No. CCL 2], A549 
[ATCC Accession No. CCL 1 85], HEK 293, KB [CCL 1 7], Detroit [e.g. Detroit 5 1 0, 
CCL 72] and WI-38 [CCL 75] cells, among others. These cell lines are all available from 
the American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 
201 10-2209. Other suitable parent cell lines may be obtained from other sources. 

30 Such El -expressing cell lines are useful in the generation of 

recombinant simian adenovirus El deleted vectors. Additionally, or alternatively, the 
invention provides cell lines that express one or more simian adenoviral gene products, 
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e.g., El a, Elb, E2a, and/or E4 ORP6, can be constructed using essentially the same 
procedures for use in the generation of recombinant simian viral vectors. Such cell lines 
can be utilized to transcomplement adenovirus vectors deleted in the essential genes that 
encode those products, or to provide helper functions necessary for packaging of a helper- 
5 dependent virus (e.g., adeno-associated virus). The preparation of a host cell according to 
this invention involves techniques such as assembly of selected DNA sequences. This 
assembly may be accomplished utilizing conventional techniques. Such techniques 
include cDNAand genomic cloning, which are well known and are described in 
Sambrook et al., cited above, use of overlapping oligonucleotide sequences of the 
10 adenovirus genomes, combined with polymerase chain reaction, synthetic methods, and 
any other suitable methods which provide the desired nucleotide sequence. 

In still another alternative, the essential adenoviral gene products 
are provided in trans by the adenoviral vector and/or helper virus. In such an instance, a 
suitable host cell can be selected from any biological organism, including prokaryotic 
15 (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and 
mammalian cells. Particularly desirable host cells are selected from among any 
mammalian species, including, without limitation, cells such as A549, WEH1, 3T3, 
10T1/2, HEK 293 cells or PERC6 (both of which express functional adenoviral El) 
[Fallaux, FJ et al, (1998), Hum Gene Ther, 9:1909-1917], Saos, C2C12, L cells, HT1 080, 
20 HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals 
including human, monkey, mouse, rat, rabbit, and hamster. The selection of the 
mammalian species providing the cells is not a limitation of this invention; nor is the type 
of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc. 

3. Assembly of Viral Particle and Transfection of a Cell Line 
2 5 Generally, when delivering the vector comprising the minigene by 

transfection, the vector is delivered in an amount from about 5 ug to about 100 ug DNA, 
and preferably about 10 to about 50 ug DNA to about 1 x 10 4 cells to about 1 x 10 13 cells, 
and preferably about 10 s cells. However, the relative amounts of vector DNA to host 
cells may be adjusted, taking into consideration such factors as the selected vector, the 
30 delivery method and the host cells selected. 

The vector may be any vector known in the art or disclosed above, 
including naked DNA, a plasmid, phage, transposon, cosmids, episomes, viruses, etc. 

36 
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Introduction into the host cell of the vector may be achieved by any means known in the 
art or as disclosed above, including transfection, and infection. One or more of the 
adenoviral genes may be stably integrated into the genome of the host cell, stably 
expressed as episomes, or expressed transiently. The gene products may ail be expressed 
5 transiently, on an episome or stably integrated, or some of the gene products may be 
expressed stably while others are expressed transiently. Furthermore, the promoters for 
each of the adenoviral genes may be selected independently from a constitutive promoter, 
an inducible promoter or a native adenoviral promoter. The promoters may be regulated 
by a specific physiological state of the organism or cell (i.e., by the differentiation state or 

10 in replicating or quiescent cells) or by exogenously-added factors, for example. 

Introduction of the molecules (as plasmids or viruses) into the host 
cell may also be accomplished using techniques known to the skilled artisan and as 
discussed throughout the specification. In preferred embodiment, standard transfection 
techniques are used, e.g., CaPO* transfection or electroporation. 

15 Assembly of the selected DNA sequences of the adenovirus (as 

well as the transgene and other vector elements into various intermediate plasmids, and 
the use of the plasmids and vectors to produce a recombinant viral particle are all 
achieved using conventional techniques. Such techniques include conventional cloning 
techniques of cDNA such as those described in texts [Sambrook et al, cited above], use of 

20 overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain 
reaction, and any suitable method which provides the desired nucleotide sequence. 
Standard transfection and co-transfection techniques are employed, e.g., CaPC>4 
precipitation techniques. Other conventional methods employed include homologous 
recombination of the viral genomes, plaquing of viruses in agar overlay, methods of 

25 measuring signal generation, and the like. 

For example, following the construction and assembly of the desired 
minigene-containing viral vector, the vector is transfected in vitro in the presence of a 
helper virus into the packaging cell line. Homologous recombination occurs between the 
helper and the vector sequences, which permits the adenovirus-transgene sequences in the 

30 vector to be replicated and packaged into virion capsids, resulting in the recombinant viral 
vector particles. The current method for producing such virus particles is transfection- 
based. However, the invention is not limited to such methods. 
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The resulting recombinant simian adenoviruses are useful in transferring a 
selected transgene to a selected cell. In 67 vivo experiments with the recombinant virus 
grown in the packaging cell lines, the El -deleted recombinant simian adenoviral vectors 
of the invention demonstrate utility in transferring a transgene to a non-simian, preferably 
5 a human, cell. 

IV. Use of the Recombinant Adenovirus Vectors 

The recombinant simian adenovirus vectors of the invention are useful for gene 
transfer to a human or non-simian veterinary patient m vitro, ex vivo, and in vivo, 

10 The recombinant adenovirus vectors described herein can be used as expression 

vectors for the production of the products encoded by the heterologous genes in vitro. 
For example, the recombinant adenoviruses containing a gene inserted into the location of 
an El deletion may be transfected into an El -expressing cell line as described above. 
Alternatively, replication-competent adenoviruses may be used in another selected cell 

15 line. The transfected cells are then cultured in the conventional manner, allowing the 
recombinant adenovirus to express the gene product from the promoter. The gene 
product may then be recovered from the culture medium by known conventional methods 
of protein isolation and recovery from culture. 

A Pan5, Pan6, Pan7, SV1, SV25 or SV39-derived recombinant simian adenoviral 

20 vector of the invention provides an efficient gene transfer vehicle that can deliver a 

selected transgene to a selected host cell in vivo or ex vivo even where the organism has 
neutralizing antibodies to one or more AAV serotypes. In one embodiment, the rAAV 
and the cells are mixed ex vivo; the infected cells are cultured using conventional 
methodologies; and the transduced cells are re-infiised into the patient These 

25 compositions are particularly well suited to gene delivery for therapeutic purposes and for 
immunization, including inducing protective immunity. 

More commonly, the Pan 5, Pan6, Pan7, SV1, SV25, or SV39 recombinant 
adenoviral vectors of the invention will be utilized for delivery of therapeutic or 
immunogenic molecules, as described below. It will be readily understood for both 

30 applications, that the recombinant adenoviral vectors of the invention are particularly well 
suited for use in regimens involving repeat delivery of recombinant adenoviral vectors. 
Such regimens typically involve delivery of a series of viral vectors in which the viral 
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capsids are alternated. The viral capsids may be changed for each subsequent 
administration, or after a pre-selected number of administrations of a particular serotype 
capsid (e.g., one, two, three, four or more). Thus, a regimen may involve delivery of a 
rAd with a first simian capsid, delivery with a rAd with a second simian capsid, and 
5 delivery with a third simian capsid. A variety of other regimens which use the Ad capsids 
of the invention alone, in combination with one another, or in combination with other Ad 
serotypes will be apparent to those of skill in the art. Optionally, such a regimen may 
involve administration of rAd with capsids of other non-human primate adenoviruses, 
human adenoviruses, or artificial serotypes such as are described herein. Each phase of 

10 the regimen may involve administration of a series of injections (or other delivery routes) 
with a single Ad serotype capsid followed by a series with another Ad serotype capsid. 
Alternatively, the recombinant Ad vectors of the invention may be utilized in regimens 
involving other non-adenoviral-mediated delivery systems, including other viral systems, 
non-viral delivery systems, protein, peptides, and other biologically active molecules. 

15 The following sections will focus on exemplary molecules which may be 

delivered via the adenoviral vectors of the invention. 

A. Ad-Mediated Delivery of Therapeutic Molecules 

In one embodiment, the above-described recombinant vectors are 
administered to humans according to published methods for gene therapy. A simian viral 

20 vector bearing the selected transgene may be administered to a patient, preferably 

suspended in a biologically compatible solution or pharmaceutical^ acceptable delivery 
vehicle. A suitable vehicle includes sterile saline. Other aqueous and non-aqueous 
isotonic sterile injection solutions and aqueous and non-aqueous sterile suspensions 
known to be pharmaceutical^ acceptable carriers and well known to those of skill in the 

25 art may be employed for this purpose. 

The simian adenoviral vectors are administered in sufficient amounts to 
transduce the target cells and to provide sufficient levels of gene transfer and expression 
to provide a therapeutic benefit without undue adverse or with medically acceptable 
physiological effects, which can be determ ined by those skilled in the medical arts. 

30 Conventional and pharmaceutical^ acceptable routes of administration include, but are 
not limited to, direct delivery to the retina and other intraocular delivery methods, direct 
delivery to the liver, inhalation, intranasal, intravenous, intramuscular, intratracheal, 
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subcutaneous, intradermal, rectal, oral and other parenteral routes of administration. 
Routes of administration may be combined, if desired, or adjusted depending upon the 
transgene or the condition. The route of administration primarily will depend on the 
nature of the condition being treated. 
5 Dosages of the viral vector will depend primarily on factors such as the 

condition being treated, the age, weight and health of the patient, and may thus vary 
among patients. For example, a therapeutically effective adult human or veterinary 
dosage of the viral vector is generally in the range of from about 100 pL to about 100 mL 
of a carrier containing concentrations of from about 1 x 10 6 to about 1 x 10 i5 particles, 
10 about 1 x 10 n to 1 x 10 13 particles, or about 1 x 10 9 to lx 10 12 particles virus. Dosages 
will range depending upon the size of the animal and the route of administration. For 
example, a suitable human or veterinary dosage (for about an 80 kg animal) for 
intramuscular injection is in the range of about 1 x 10 9 to about 5 x 10 12 particles per mL, 
for a single site. Optionally, multiple sites of administration may be delivered. In another 
15 example, a suitable human or veterinary dosage may be in the range of about 1 x 10 u to 
about 1 x 10 15 particles for an oral formulation. One of skill in the art may adjust these 
doses, depending the route of administration, and the therapeutic or vaccinal application 
for which the recombinant vector is employed. The levels of expression of the transgene, 
or for an immunogen, the level of circulating antibody, can be monitored to determine the 
20 frequency of dosage administration. Yet other methods for determining the timing of 
frequency of administration will be readily apparent to one of skill in the art. 

An optional method step involves the co-administration to the patient, 
either concurrently with, or before or after administration of the viral vector, of a suitable 
amount of a short acting immune modulator. The selected immune modulator is defined 
25 herein as an agent capable of inhibiting the formation of neutralizing antibodies directed 
against the recombinant vector of this invention or capable of inhibiting cytolytic T 
lymphocyte (CTL) elimination of the vector. The immune modulator may interfere with 
the interactions between the T helper subsets (T HI or T H2 ) and B cells to inhibit 
neutralizing antibody formation. Alternatively, the immune modulator may inhibit the 
30 interaction between T H i cells and CTLs to reduce the occurrence of CTL elimination of 
the vector. A variety of useful immune modulators and dosages for use of same are 
disclosed, for example, in Yang etal, J. Virol, 70(9) (Sept., 1996); International Patent 
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Application No. WO96/12406, published May 2, 1996; and International Patent 
Application No.PCT/US96/03035, all incorporated herein by reference. 
1 . Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include 
5 hormones and growth and differentiation factors including, without limitation, insulin, 
glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing 
factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human 
chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, 
angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), 

10 connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic 
fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth 
factor (TGF ), platelet-derived growth factor (PDGF), insulin growth factors I and n 
(IGF-I and IGF-H), any one of the transforming growth factor superfamily, including 
TGF, activins, inhibins, or any of the bone morphogenic proteins (BMP) BMPs 1-15, any 

15 one of the heregluin/neujeguIin/ARIA/neu differentiation factor (NDF) family of growth 
factors, nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), 
neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line 
derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of 
semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), ephrins, 

20 noggin, sonic hedgehog and tyrosine hydroxylase. 

Other useful transgene products include proteins that regulate the 
immune system including, without limitation, cytokines and lymphokines such as 
thrombopoietin (TPO), interleukins (IL) IL-1 through IL-25 (including, e.g., IL-2, IL-4, 
DL-12 and IL-1 8), monocyte chemoattractant protein, leukemia inhibitory factor, 

25 granulocyte-macrophage colony stimulating factor, Fas ligand, tumor necrosis factors 
and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene products produced by the 
immune system are also useful in the invention. These include, without limitation, 
immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized 
antibodies, single chain antibodies, T cell receptors, chimeric T cell receptors, single 

30 chain T cell receptors, class I and class II MHC molecules, as well as engineered 

immunoglobulins and MHC molecules. Useful gene products also include complement 



41 



I 



WO 03/046124 



PCT/US02/33645 



regulatory proteins such as complement regulatory proteins, membrane cofactor protein 
(MCP), decay accelerating factor (DAF), CR1 , CF2 and CD59. 

Still other useful gene products include any one of the receptors 
for the hormones, growth factors, cytokines, lymphokines, regulatory proteins and 
5 immune system proteins. The invention encompasses receptors for cholesterol regulation, 
including the low density lipoprotein (LDL) receptor, high density lipoprotein (HDL) 
receptor, the very low density lipoprotein (VLDL) receptor, and the scavenger receptor. , 
The invention also encompasses gene products such as members of the steroid hormone 
receptor superfamily including glucocorticoid receptors and estrogen receptors, Vitamin 

10 D receptors and other nuclear receptors. In addition, useful gene products include 

transcription factors such as junjos, max, mad, serum response factor (SRF), AP-1, AP2, 
myb % MyoD and myogenin, ETS-box containing proteins, TFE3, E2F, ATFl, ATF2, 
ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-box binding proteins, 
interferon regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, 

15 GATA-box binding proteins, e.g., GATA-3, and the forkhead family of winged helix 
proteins. 

Other useful gene products include, carbamoyl synthetase I, 
ornithine transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, 
fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha- 1 antitrypsin, glucose-6- 

20 phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathione beta- 
synthase, branched chain ketoacid decarboxylase, albumin, isovaleryl-coA 
dehydrogenase, propionyl CoA carboxylase, methyl maionyl CoA mutase, glutaryl CoA 
dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic phosphorylase, 
phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a cystic fibrosis 

25 transmembrane regulator (CFTR) sequence, and a dystrophin cDNA sequence. 

Other useful gene products include non-naturally occurring 
polypeptides, such as chimeric or hybrid polypeptides having a non-naturally occurring 
amino acid sequence containing insertions, deletions or amino acid substitutions. For 
example, single-chain engineered immunoglobulins could be useful in certain 

30 immunocompromised patients. Other types of non-naturally occurring gene sequences 
include antisense molecules and catalytic nucleic acids, such as ribozymes, which could 
be used to reduce overexpression of a target. 
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Reduction and/or modulation of expression of a gene are 
particularly desirable for treatment of hyperproliferative conditions characterized by 
hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include those 
polypeptides which are produced exclusively or at higher levels in hyperproliferative cells 
5 as compared to normal cells. Target antigens include polypeptides encoded by oncogenes 
such as myb, myc, fyn, and the translocation gene bcr/abl, ras, src, P53, neu, trk and 
EGRF. In addition to oncogene products as target antigens, target polypeptides for 
anti-cancer treatments and protective regimens include variable regions of antibodies 
made by B cell lymphomas and variable regions of T cell receptors of T cell lymphomas 

10 which, in some embodiments, are also used as target antigens for autoimmune disease. 
Other tumor-associated polypeptides can be used as target polypeptides such as 
polypeptides which are found at higher levels in tumor cells including the polypeptide 
recognized by monoclonal antibody 17-1 A and folate binding polypeptides. 

Other suitable therapeutic polypeptides and proteins include those 

15 which may be useful fortreating individuals suffering from autoimmune diseases and 

disorders by conferring a broad based protective immune response against targets that are 
associated with autoimmunity including cell receptors and cells which produce self- 
directed antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis 
(RA), multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin dependent 

20 diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing 

spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's 
granulomatosis, Crohn's disease and ulcerative colitis. Each of these diseases is 
characterized by T cell receptors (TCRs) that bind to endogenous antigens and initiate the 
inflammatory cascade associated with autoimmune diseases. 

25 The simian adenoviral vectors of the invention are particularly 

well suited for therapeutic regimens in which multiple adenoviral-mediated deliveries of 
transgenes is desired, e.g., in regimens involving redelivery of the same transgene or in 
combination regimens involving delivery of other transgenes. Such regimens may involve 
administration of a Pan5, Pan6, Pan7, SV1, SV25 or SV39 simian adenoviral vector, 

30 followed by re-administration with a vector from the same serotype adenovirus. 

Particularly desirable regimens involve administration of a Pan5, Pan6, Pan7, SV1, SV25 
or SV39 simian adenoviral vector of the invention, in which the serotype of the viral 
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vector delivered in the first administration differs from the serotype of the viral vector 
utilized in one or more of the subsequent administrations. For example, a therapeutic 
regimen involves administration of a Pan5, Pan6, Pan7, SV1, SV25 or SV39 vector and 
repeat administration with one or more adenoviral vectors of the same or different 
5 serotypes. In another example, a therapeutic regimen involves administration of an 

adenoviral vector followed by repeat administration with a Pan5, Pan6, Pan7, SV1, SV25 
or SV39 vector of the invention which differs from the serotype of the first delivered 
adenoviral vector, and optionally further administration with another vector which is the 
same or, preferably, differs from the serotype of the vector in the prior administration 

10 steps. These regimens are not limited to delivery of adenoviral vectors constructed using 
the Pan5, Pan6, Pan7, SV1, SV25 or SV39 simian serotypes of the invention. Rather, 
these regimens can readily utilize vectors other adenoviral serotypes, including, without 
limitation, other simian adenoviral serotypes (e.g., Pan9 or C68, CI, etc), other non- 
human primate adenoviral serotypes, or human adenoviral serotypes, in combination with 

15 one or more of the Pan5, Pan6, Pan7, SV1 , SV25 or SV39 vectors of the invention. 

Examples of such simian, other non-human primate and human adenoviral serotypes are 
discussed elsewhere in this document. Further, these therapeutic regimens may involve 
either simultaneous or sequential delivery of Pan 5, Pan6, Pan7, SV1, SV25, and/or SV39 
adenoviral vectors of the invention in combination with non-adenoviral vectors, non-viral 

2 0 vectors, and/or a variety of other therapeutically useful compounds or molecules. The 
present invention is not limited to these therapeutic regimens, a variety of which will be 
readily apparent to one of skill in the art. 

B. Ad-Mediated Delivery of Immunogenic Transgenes 

The recombinant simian adenoviruses may also be employed as 

25 immunogenic compositions. As used herein, an immunogenic composition is a 

composition to which a. humoral (e.g., antibody) or cellular (e.g., a cytotoxic T cell) 
response is mounted to a transgene product delivered by the immunogenic composition 
following delivery to a mammal, and preferably a primate. The present invention 
provides a recombinant simian Ad that can contain in any of its adenovirus sequence 

30 deletions a gene encoding a desired immunogen. The simian adenovirus is likely to be 
better suited for use as a live recombinant virus vaccine in different animal species 
compared to an adenovirus of human origin, but is not limited to such a use. The 
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recombinant adenoviruses can be used as prophylactic or therapeutic vaccines against any 
pathogen for which the antigen(s) crucial for induction of an immune response and able to 
limit the spread of the pathogen has been identified and for which the cDNA is available. 

Such vaccina! (or other immunogenic) compositions are formulated in a 
5 suitable delivery vehicle, as described above. Generally, doses for the immunogenic 
compositions are in the range defined above for therapeutic compositions. The levels of 
immunity of the selected gene can be monitored to determine the need, if any, for 
boosters. Following an assessment of antibody titers in the serum, optional booster 
immunizations may be desired. 

10 Optionally, a vaccinal composition of the invention may be formulated to 

contain other components, including, e.g. adjuvants, stabilizers, pH adjusters, 
preservatives and the like. Such components are well known to those of skill in the 
vaccine art. Examples of suitable adjuvants include, without limitation, liposomes, alum, 
monophosphoryl lipid A, and any biologically active factor, such as cytokine, an 

1 5 interleukin, a chemokine, a ligands, and optimally combinations thereof Certain of these 
biologically active factors can be expressed in vivo, e.g., via a plasmid or viral vector. 
For example, such an adjuvant can be administered with a priming DNA vaccine 
encoding an antigen to enhance the antigen-specific immune response compared with the 
immune response generated upon priming with a DNA vaccine encoding the antigen 

20 only. 

The recombinant adenoviruses are administered in a "an immunogenic 
amount", that is, an amount of recombinant adenovirus that is effective in a route of 
administration to transfect the desired cells and provide sufficient levels of expression of 
the selected gene to induce an immune response. Where protective immunity is provided, 

25 the recombinant adenoviruses are considered to be vaccine compositions useful in 
preventing infection and/or recurrent disease. 

Alternatively, or in addition, the vectors of the invention may contain a 
transgene encoding a peptide, polypeptide or protein which induces an immune response 
to a selected immunogen. The recombinant adenoviruses of this invention are expected to 

30 be highly efficacious at inducing cytolytic T cells and antibodies to the inserted 
heterologous antigenic protein expressed by the vector. 
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For example, immunogens may be selected from a variety of viral 
families. Example of desirable viral families against which an immune response would 
be desirable include, the picornavirus family, which includes the genera rhinoviruses, 
which are responsible for about 50% of cases of the common cold; the genera 
5 enteroviruses, which include poxviruses, coxsackieviruses, echoviruses, and human 
enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are 
responsible for foot and mouth diseases, primarily in non-human animals. Within the 
picornavirus family of viruses, target antigens include the VP 1, VP2, VP3, VP4, and 
VPG. Another viral family includes the calcivirus family, which encompasses the 
10 Norwalk group of viruses, which are an important causative agent of epidemic 

gastroenteritis. Still another viral family desirable for use in targeting antigens for 
inducing immune responses in humans and non-human animals is the togavirus family, 
which includes the genera alphavirus, which include Sindbis viruses, RossRiver virus, 
and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, including Rubella 
15 virus. The flaviviridae family includes dengue, yellow fever, Japanese encephalitis, St. 
Louis encephalitis and tick borne encephalitis viruses. Other target antigens may be 
generated from the Hepatitis C or the coronavirus family, which includes a number of 
non-human viruses such as infectious bronchitis virus (poultry), porcine transmissible 
gastroenteric virus (pig), porcine hemagglutinating encephalomyelitis virus (pig), feline 
20 infectious peritonitis virus (cats), feline enteric coronavirus (cat), canine coronavirus 
(dog), and human respiratory coronaviruses, which may cause the common cold and/or 
non-A, B or C hepatitis: Within the coronavirus family, target antigens include the El 
(also called M or matrix protein), E2 (also called S or Spike protein), E3 (also called HE 
or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), orN 
25 (nucleocapsid). Still other antigens may be targeted against the rhabdovirus family, 

which includes the genera vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general 
lyssavirus (e.g., rabies). Within the rhabdovirus family, suitable antigens may be derived 
from the O protein or the N protein. The family filoviridae, which includes hemorrhagic 
fever viruses such as Marburg and Ebola vims, may be a suitable source of antigens. The 
30 paramyxovirus family includes parainfluenza Virus Type 1 , parainfluenza Virus Type 3, 
bovine parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 
2, parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, 
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morbillivirus, which includes measles and canine distemper, and pneumovirus, which 
includes respiratory syncytial virus. The influenza virus is classified within the family 
orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the Nl protein). 
The bunyavirus family includes the genera bunyavirus (California encephalitis, La 
5 Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a hemahagin fever 
virus), nairovirus (Nairobi sheep disease) and various unassigned bungaviruses. The 
arenavirus family provides a source of antigens against LCM and Lassa fever virus. The 
reovirus family includes the genera reovirus, rotavirus (which causes acute gastroenteritis 
in children), orbiviruses, and cultivirus (Colorado Tick fever, Lebombo (humans), equine 

10 encephalosis, blue tongue). 

The retrovirus family includes the sub-family oncorivirinal which 
encompasses such human and veterinary diseases as feline leukemia virus, HTLVI and 
HTLV1I, lentivirinal (which includes human immunodeficiency virus (HIV), simian 
immunodeficiency virus (S1V), feline immunodeficiency virus (FIV), equine infectious 

15 anemia virus, and spumavirinal). Among the lentiviruses, many suitable antigens have 
been described and can readily be selected. Examples of suitable HIV and SIV antigens 
include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, Nef, and Rev proteins, 
as well as various fragments thereof. For example, suitable fragments of the Env protein 
may include any of its subunits such as the gpl20, gpl60, gp41, or smaller fragments 

20 thereof, e.g., of at least about 8 amino acids in length. Similarly, fragments of the tat 

protein may be selected. [See, US Patent 5,891,994 and US Patent 6,193,981.] See, also, 
the HIV and SIV proteins described in D.H. Barouch et al, J. Virol., 75(5):2462-2467 
(March 2001), and BLR. Amara, et al, Science, 292:69-74 (6 April 2001). In another 
example, the HIV and/or SIV immunogenic proteins or peptides may be used to form 

25 fusion proteins or other immunogenic molecules. See, e.g., the HIV-1 Tat and/or Nef 

fusion proteins and immunization regimens described in WO 01/54719, published August 
2, 2001, and WO 99/16884, published April 8, 1999. The invention is not limited to the 
HIV and/or SFV immunogenic proteins or peptides described herein. In addition, a 
variety of modifications to these proteins have been described or could readily be made 

30 by one of skill in the art. See, e.g., the modified gag protein that is described in US Patent 
5,972,596. Further, any desired HIV and/or SIV immunogens may be delivered alone or 
in combination. Such combinations may include expression from a single vector or from 
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multiple vectors. Optionally, another combination may involve delivery of one or more 
expressed immunogens with delivery of one or more of the immunogens in protein form. 
Such combinations are discussed in more detail below. 

The papovavirus family includes the sub-family polyomaviruses (BKU 
5 and JCU viruses) and the sub-family papillomavirus (associated with cancers or 

malignant progression of papilloma). The adenovirus family includes viruses (EX, AD7, 
ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus family 
feline parvovirus (feline enteritis), feline panleucopeniavirus, canine parvovirus, and 
porcine parvovirus. The herpesvirus family includes the sub-family alphaherpesvirinae, 

10 which encompasses the genera simplexvirus (HSVI, HSVU), varied lovirus 

(pseudorabies, varicella zoster) and the sub-family betaherpesvirinae, which includes the 
genera cytomegalovirus (HCMV, muromegalo virus) and the sub-family 
gammaherpesvirinae, which includes the genera lymphocryptovirus, EBV (Burkitts 
lymphoma), infectious rhinotracheitis, Marek's disease virus, and rhadinovirus. The 

15 poxvirus family includes the sub-family chordopoxvirinae, which encompasses the genera 
orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), parapoxvirus, avipoxvirus, 
capripoxvirus, leporipoxvirus, suipoxvirus, and the sub-family entomopoxvirinae. 
The hepadnavirus family includes the Hepatitis B virus. One unclassified virus which 
may be suitable source of antigens is the Hepatitis delta virus. Still other viral sources 

20 may include avian infectious bursal disease virus and porcine respiratory and 

reproductive syndrome virus. The alphavirus family includes equine arteritis virus and 
various Encephalitis viruses. 

The present invention may also encompass immunogens which 
are useful to immunize a human or non-human animal against other pathogens including 

25 bacteria, fungi, parasitic microorganisms or multicellular parasites which infect human 
and non-human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial 
pathogens include pathogenic gram-positive cocci include pneumococci; staphylococci; 
and streptococci. Pathogenic gram-negative cocci include meningococcus; gonococcus. 
Pathogenic enteric gram-negative bacilli include enterobacteriaceae; pseudomonas, 

30 acinetobacteria and eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; 
H. ducreyi (which causes chancroid); brucella; Franisella tularensis (which causes 
tularemia); yersinia (pasteurella); streptobacillus moniliformis and spirillum; Gram- 
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positive bacilli include listeria monocytogenes; erysipelothrix rhusiopathiae; 
Corynebacterium diphtheria (diphtheria); cholera; B. anthracis (anthrax); donovanosis 
(granuloma inguinale); and bartonellosis. Diseases caused by pathogenic anaerobic 
bacteria include tetanus; botulism; other Clostridia; tuberculosis; leprosy; and other 
5 mycobacteria. Pathogenic spirochetal diseases include syphilis; treponematoses: yaws, 
pinta and endemic syphilis; and leptospirosis. Other infections caused by higher 
pathogen bacteria and pathogenic fungi include actinomycosis; nocardiosis; 
cryptococcosis, blastomycosis, histoplasmosis and coccidioidomycosis; candidiasis, 
aspergillosis, and mucormycosis; sporotrichosis; paracoccidiodomycosis, petriellidiosis, 

10 torulopsosis, mycetoma and chromomycosis; and derrnatophytosis. Rickettsial infections 
include Typhus fever, Rocky Mountain spotted fever, Q fever, and Rickettsialpox. 
Examples of mycoplasma and chlamydial infections include: mycoplasma pneumoniae; 
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. 
Pathogenic eukaryotes encompass pathogenic protozoans and helminths and infections 

15 produced thereby include: amebiasis; malaria; leishmaniasis; trypanosomiasis; 
toxoplasmosis; Pneumocystis carinii\ Dickons; Toxoplasma gondii] babesiosis; 
giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes or flukes; and 
cestode (tapeworm) infections. 

Many of these organisms and/or toxins produced thereby have 

20 been identified by the Centers for Disease Control [(CDC), Department of Heath and 
Human Services, USA], as agents which have potential for use in biological attacks. For 
example, some of these biological agents, include, Bacillus cmthracis (anthrax), 
Clostridium botulinum and its toxin (botulism), Yersinia pestis (plague), variola major 
(smallpox), Francisella tularensis (tularemia), and viral hemorrhagic fevers [filoviruses 

25 (e.g., Ebola, Marburg], and arenaviruses [e.g., Lassa, Machupo]), all of which are 

currently classified as Category A agents; Coxiella burnetti (Q fever); Brucella species 
(brucellosis), Burkholderia mallei (glanders), BurkJiolderia pseudomallei (meloidosis), 
Ricinus communis and its toxin (ricin toxin), Clostridium perfringens and its toxin 
(epsilon toxin), Staphylococcus species and their toxins (enterotoxin B), Chlamydia 

30 psittaci (psittacosis), water safety threats (e.g., Vibrio cholerae, Crytosporidium parvum\ 
Typhus fever (Rickettsia powazetif), and viral encephalitis (alphaviruses, e.g., 
Venezuelan equine encephalitis; eastern equine encephalitis; western equine 
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encephalitis); all of which are currently classified as Category B agents; and Nipan virus 
and hantaviruses, which are currently classified as Category C agents. In addition, other 
organisms, which are so classified or differently classified, may be identified and/or used 
for such a purpose in the future. It will be readily understood that the viral vectors and 
5 other constructs described herein are useful to deliver antigens from these organisms, 
viruses, their toxins or other by-products, which will prevent and/or treat infection or 
other adverse reactions with these biological agents. 

Administration of the vectors of the invention to deliver 
immunogens against the variable region of the T cells elicit an immune response 

10 including CTLs to eliminate those T cells. In RA, several specific variable regions of 
TCRs which are involved in the disease have been characterized. These ICRs include 
V-3, V-14, V-17 and Va-17. Thus, delivery of a nucleic acid sequence that encodes at 
least one of these polypeptides will elicit an immune response that will target T cells 
involved in RA. In MS, several specific variable regions of TCRs which are involved in 

15 the disease have been characterized. These TCRs include V-7 and Va-10. Thus, delivery 
of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an 
immune response that will target T cells involved in MS. In scleroderma, several 
specific variable regions of TCRs which are involved in the disease have been 
characterized. These TCRs include V-6, V-8, V-14 and Va-16, Va-3C, Va-7, Va-14, 

20 Va-15, Va-16, Va-28 and Va-12. Thus, delivery of a recombinant simian adenovirus that 
encodes at least one of these polypeptides will elicit an immune response that will target 
T cells involved in scleroderma. 

C. Ad-Mediated Delivery Methods 

The therapeutic levels, or levels of immunity, of the selected gene can be 

25 monitored to determine the need, if any, for boosters. Following an assessment of CD8+ 
T cell response, or optionally, antibody titers, in the serum, optional booster 
immunizations may be desired. Optionally, the recombinant simian adenoviral vectors of 
the invention may be delivered in a single administration or in various combination 
regimens, e.g., in combination with a regimen or course of treatment involving other 

30 active ingredients or in a prime-boost regimen. A variety of such regimens have been 
described in the art and may be readily selected. 
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For example, prime-boost regimens may involve the administration of a 
DNA (e.g., plasmid) based vector to prime the immune system to second, booster, 
administration with a traditional antigen, such as a protein or a recombinant virus carrying 
the sequences encoding such an antigen. See, e.g., WO 00/1 1 140, published March 2, 
5 2000, incorporated by reference. Alternatively, an immunization regimen may involve 
the administration of a recombinant simian adenoviral vector of the invention to boost the 
immune response to a vector (either viral or DNA-based) carrying an antigen, or a 
protein. In still another alternative, an immunization regimen involves administration of a 
protein followed by booster with a vector encoding the antigen. 
10 In one embodiment, the invention provides a method of priming and 

boosting an immune response to a selected antigen by delivering a plasmid DNA vector 
carrying said antigen, followed by boosting with a recombinant simian adenoviral vector 
of the invention. In one embodiment, the prime-boost regimen involves the expression 
of multiproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, Science, 
15 292:69-74 (6 April 2001) which describes a multiprotein regimen for expression of 
protein subunits useful for generating an immune response against HIV and SIV. For 
example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr and Env, Tat, and 
Rev from a single transcript. Alternatively, the SIV Gag, Pol and HIV-1 Env is delivered 
in a recombinant adenovirus construct of the invention. Still other regimens are described 
20 in WO 99/16884 and WO 01/54719. 

However, the prime-boost regimens are not limited to immunization for 
HIV or to delivery of these antigens. For example, priming may involve delivering with a 
first chimp vector of the invention followed by boosting with a second chimp vector, or 
with a composition containing the antigen itself in protein form. In one example, the 
25 prime-boost regimen can provide a protective immune response to the virus, bacteria or 
other organism from which the antigen is derived. In another desired embodiment, the 
prime-boost regimen provides a therapeutic effect that can be measured using convention 
assays for detection of the presence of the condition for which therapy is being 
administered. 

30 The priming composition may be administered at various sites in the body 

in a dose dependent manner, which depends on the antigen to which the desired immune 
response is being targeted. The invention is not limited to the amount or situs of 
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injection(s) or to the pharmaceutical carrier. Rather, the regimen may involve a priming 
and/or boosting step, each of which may include a single dose or dosage that is 
administered hourly, daily, weekly or monthly, or yearly. As an example, the mammals 
may receive one or two doses containing between about 10 ^g to about 50 \xg of plasmid 
5 in carrier. A desirable amount of a DNA composition ranges between about I fig to about 
10,000 |xg of the DNA vector. Dosages may vary from about 1 \xg to 1000 jig DNA per 
kg of subject body weight. The amount or site of delivery is desirably selected based 
upon the identity and condition of the mammal. 

The dosage unit of the vector suitable for delivery of the antigen to the 

10 mammal is described herein. The vector is prepared for administration by being 

suspended or dissolved in a pharmaceutical ly or physiologically acceptable carrier such as 
isotonic saline; isotonic salts solution or other formulations that will be apparent to those 
skilled in such administration. The appropriate carrier will be evident to those skilled in 
the art and will depend in large part upon the route of administration. The compositions of 

15 the invention may be administered to a mammal according to the routes described above, 
in a sustained release formulation using a biodegradable biocompatible polymer, or by 
on-site delivery using micelles, gels and liposomes. Optionally, the priming step of this 
invention also includes administering with the priming composition, a suitable amount of 
an adjuvant, such as are defined herein. 

2 o Preferably, a boosting composition is administered about 2 to about 27 

weeks after administering the priming composition to the mammalian subject. The 
administration of the boosting composition is accomplished using an effective amount of 
a boosting composition containing or capable of delivering the same antigen as 
administered by the priming DNA vaccine. The boosting composition may be composed 

25 of a recombinant viral vector derived from the same viral source (e.g., adenoviral 
sequences of the invention) or from another source. Alternatively, the "boosting 
composition" can be a composition containing the same antigen as encoded in the 
priming DNA vaccine, but in the form of a protein or peptide, which composition induces 
an immune response in the host. In another embodiment, the boosting composition 

30 contains a DNA sequence encoding the antigen under the control of a regulatory sequence 
directing its expression in a mammalian cell, e.g., vectors such as well-known bacterial or 
viral vectors. The primary requirements of the boosting composition are that the antigen 



52 



WO 03/046124 



PCT/US02/33645 



of the composition is the same antigen, or a cross-reactive antigen, as that encoded by the 
priming composition. 

In another embodiment, the simian adenoviral vectors of the invention are 
also well suited for use in a variety of other immunization and therapeutic regimens. 
5 Such regimens may involve delivery of simian adenoviral vectors of the invention 

simultaneously or sequentially with Ad vectors of different serotype capsids, regimens in 
which adenoviral vectors of the invention are delivered simultaneously or sequentially 
with non-Ad vectors, regimens in which the adenoviral vectors of the invention are 
delivered simultaneously or sequentially with proteins, peptides, and/or other biologically 
10 useful therapeutic or immunogenic compounds. Such uses will be readily apparent to one 
of skill in the art. 

The following examples illustrate the cloning of the simian adenoviruses 
and the construction of exemplary recombinant adenovirus vectors of the present 
invention. These examples are illustrative only, and do not limit the scope of the present 
15 invention. 

Example 1 - Viral Propagation 

The Pan5 [ATCC Accession No. VR-591], Pan6 [ATCC Accession No. 
VR-592], and Pan7 [ATCC Accession No. VR-593] viruses, originally isolated from 

20 lymph nodes from chimpanzees, were propagated in 293 cells [ATCC CRL1573]. 
Typically, these cells are cultured in Dulbecco's Modified Eagles Medium (DMEM; 
Sigma, St Louis, MO.) supplemented with 10% fetal calf serum (FCS) [Sigma or 
Hyclone, Logan, UT] and 1 % Penicillin-Streptomycin (Sigma). Infection of 293 cells is 
carried out in DMEM supplemented with 2% FCS for the first 24 hours, after which FCS 

25 is added to bring the final concentration to 1 0%. Infected cells are harvested when 100% 
of the cells exhibit virus-induced cytopathic effect (CPE), and are then collected, and 
concentrated by centrifugation. Cell pellets are resuspended in 10 mM Tris (pH 8.0), and 
lysed by 3 cycles of freezing and thawing. Virus preparations are obtained following two 
ultra centrifugation steps on cesium chloride density gradients and stocks of virus are 

30 diluted to 1 to 5 x 10 12 particles/ml in 10 mM Tris/100 mM NaCI/50% glycerol and stored 
at-70°C 
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The ability of 293 cells to propagate these adenoviruses exceeded expectations 
which were based on knowledge of other non-human adenovirus serotypes. 



Virus Yield (virus particles produced in 8x1 0 8 cells') 
Pan5 8.8 x 10 13 

5 Pan6 1.6 x 10 14 

Pan7 8,8 xlO 13 

Example 2 - Characterization of Viral Genomic DNA 
10 Genomic DNA was isolated from the purified virus preparations of 



Example 1 and digested with HindlH or BarnHI restriction enzymes following the 
manufacturers* recommendations. The results (not shown) revealed that that the Pan5, 
Pan6, Pan7 genomes of the invention and the published Pan 9 (C68) genome show 
different restriction patterns, and thus, are distinct from each other. 

15 The nucleotide sequences of Pan5, Pan6 and Pan7 were determined. The 

nucleotide sequence of the top strand of Pan5 DNA is reported in SEQ ID NO: 1. The 
nucleotide sequence of the top strand of Pan6 DNA is reported in SEQ ID NO: 5. The 
nucleotide sequence of the top strand of Pan7 DNA is reported in SEQ ID NO: 9. 

Regulatory and coding regions in the viral DNA sequences were 

20 identified by homology to known adenoviral sequences using the "Clustal W" program 
described above at conventional settings. See the tables above providing the adenoviral 
sequences. Open reading frames were translated and the predicted amino acid sequences 
examined for homology to previously described adenoviral protein sequences, Ad4, Ad5 f 
Ad7, Adl2, and Ad4(h 

25 Analysis of the sequence revealed a genome organization that is similar to 

that present in human adenoviruses, with the greatest similarity to human Ad4. However, 
substantial differences in the hexon hypervariable regions were noted between the 
chimpanzee adenoviruses and other known adenoviruses, including AdHu4. These 
differences fit well with the serological cross-reactivity data that has been obtained (see 

30 below). 

An alignment of a portion of the hexon sequences is shown in Fig. 1. The 
portion shown is from the region of the hexon that corresponds to the outwardly disposed 
extended loops DEI and FG1 where the most variability between serotypes is observed. 
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An intervening portion that contributes to the base of the hexon (corresponding to 
residues 308-368 of the published AdC68 sequence; US Patent 6,083,716), and is highly 
conserved between serotypes, is also present. The following table summarizes the pair- 
wise comparisons of the amino acids in the hexon proteins. 



Comparison 


Hexon amino-acid 
Similarity (%) 


#1 


#2 




AdC5 


AdC7 


99.0 


AdC5 


AdC68 


98.3 


AdC5 


AdC6 


88.0 


AdC5 


AdCl 


84.9 


AdC6 


AdC7 


87.7 


AdC6 


AdC68 


87.3 


AdC6 


AdCl 


84.9 


AdC7 


AdC68 


97.5 


AdC7 


AdCl 


84.8 


AdC68 


AdCl 


84.9 



Analysis of the fiber knob domain (which is responsible for receptor 
binding) of the chimpanzee adenoviruses shows an overall similarity in structure (Fig, 2). 
10 The degree of sequence similarity between the El proteins of huAdS and 

C68 (see Tables below) is similar to that between huAd5 and Pan-5, Pan-6, and Pan-7. 
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Comparison 


Rla (\^fs\ aminn-acid 

Lv L d ^UOI C1I11JUJVJ flwlU 

identity 


ft i 


#2 




AdHuS 

rvu J lUJ 


AdC5 


36.6 


Adha5 


AdC6 


28.5 


rvUnuJ 


AdP7 


34.9 


rvunuj 




35.6 


AdHuS 

r*VJ JL I LI J 


AdCl 


35.6 


AdCS 


AdC6 

AUvv 


68.3 


Adf^ 


AdC7 


96.9 


Adrs 


AdC68 

AUVyvO 


80.4 


AdC5 


AdCl 


51.3 i 


AdC6 


AdC7 


69.3 


AdC6 


AdC68 


59.4 


AdC6 


AdCl 


37.7 


AdC7 


AdC68 


81.5 


AdC7 


AdCl 


51.0 


AdC68 


AdCl 


54.9 





Sequence Identity with human 
Ad5 




Elb Small T 
Protein 


Elb Large T 
Protein 


C68 


47.3% 


55.8% 


Pan-5 


43.2% 


54.5% 


Pan-6 


45.3% 


54.5% 


Pan-7 


46.4% 


53.8% 
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Replication-defective versions of AdC5, AdC6 and AdC7 were created by 
molecular cloning methods described in the following examples in which minigene 
cassettes were inserted into the place of the Ela and Elb genes. The molecular clones of 
the recombinant viruses were rescued and grown up in 293 cells for large-scale 
5 purification using the published CsCl sedimentation method [K. Fisher et ai t J. Virol., 
70:520 (1996)]. Vector yields were based on 50 plate (150 mm) preps in which 
approximately 1 x 10 9 293 cells were infected with the corresponding viruses. Yields 
were determined by measuring viral particle concentrations spectrophotometrically. After 
having constructed El-deleted vectors, it was determined that HEK 293 cells (which 

10 express human adenovirus serotype 5 El functions) trans-complement the El deletions of 
the novel viral vectors and allow for the production of high titer stocks. Examples of 
virus yields for a few of these recombinant viruses are shown in the table below. 

The transgenes for these vectors, p-galactosidase (LacZ), green 
fluorescent protein (GFP), alpha-l-anti-trypsin (Al AT), ebola glycoprotein (ebo), a 

15 soluble ebola glycoprotein variant lacking the transmembrane and cytoplasmic domains 
(sEbo), and three deletion mutants of the ebola glycoprotein (EboA2, EboA3, and 
EboA4), were expressed by the cytomegalovirus promoter (CMV). In the following table, 
ND indicates that the study has not yet been done. 



Transgene 


Viral backbone/Vector yield (Viral particles x 10' 3 ) 




AdHu5 




AdC7 




AdC68 




AdC6 


















CMVLacZ 


1.5 




1.4 




3.3 




6.1 


CMVGFP 


2.5 




3.6 




8 




10 


CMVA1AT 


3.7 




6 




10 




ND 


CMVEbo 


1.1 




4.3 




ND 




ND 


CMVsEbo 


4,9 




5.4 




ND 




ND 


CMVEboA2 


1 




9.3 




ND 




ND 


CMVEboA3 


0.8 




9.5 




ND 




ND 


CMVEboA4 


1.4 




6.2 




ND 




ND 
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The ability of human adenovirus El to trans-complement the El -deleted 
chimpanzee viruses of the invention is highly advantageous, as it permits the production 
of El -deleted chimpanzee adenoviral vectors of the invention, while reducing or 
eliminating the risk of homologous recombination due to the differences in sequences 
5 between human Ad and the chimpanzee adenoviruses described herein. 

Example 3 - Serological Studies of Pan 5, 6, and 7 Viruses 

Because of the differences in the hexon hypervariable region, it was anticipated 
that the C5, C6, and C7 viruses would be serologically distinct from human adenoviruses, 

10 including AdHu4. 

1 . Cross-Reactivity of Wild-type Viruses 

For screening of wild-type viruses in order to make a determination of 
antibody cross-reactivity, the replication competent viruses were used and inhibition of 
cytopathic effects (CPE) was measured. Briefly, preparations of adenoviruses (AdhuS, 

15 Pan-5, Pan-6, Pan-7 and AdC68) stored at 5xl0 12 particles/ml were diluted 1/600 for the 
assays. This concentration of virus was selected since it results in 1 00% CPE within 48 
hours in the absence of neutralization. Prior to adding the virus to 293 cells (4x1 0 4 
cells/well in a 96 well dish), 1 :20 dilutions of sera were added. The assay is read as the 
presence or absence of CPE; full neutralization would read as no cytopathic effect. The 

20 results are summarized in the Table below. The feet that 9/36 human sera neutralized 

Adhu5 induced CPE is consistent with previous estimates of neutralizing antibodies in the 
human population. The numbers indicate the total individuals who showed neutralization 
(numerator) versus the total number screened (denominator). ND = not determined. 

Neutralization by 1/20 diln of serum 





Human 


Rhesus 


Chimpanzee 




(N=36) 


(N=52) 


(N=20) 


AdhuS • 


9/36 


ND 


ND 


AdC68 


1/36 


0/52 


12/20 


Pan 5 


0/36' 


0/52 


10/20 


Pan 6 


0/36 


0/52 


9/20 


Pan 7 


0/36 


0/52 


12/20 
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Of all human sera screened, 35/36 were negative for neutralization to AdC68 
while 36/36 were negative for neutralization to Pan-5, Pan-6 and Pan-7. Of 52 rhesus 
monkeys screened, none showed neutralization to any chimpanzee adenovirus; rhesus 
monkey is the preferred pre-clinical model for evaluating HIV vaccines. Between 9 to 12 
5 out of 20 chimpanzees had substantial neutralization to one or another of the chimpanzee 
adenoviruses consistent with the fact these are indeed endemic chimpanzee-specific 
pathogens. Interestingly, there are chimpanzees with neutralizing antibodies only to Pan- 
5, Pan-6 or AdC68 supporting the hypothesis that several of these chimpanzee adenoviral 
vectors will not cross neutralize each other and are distinct serotypes. 

10 The same assay was carried out for 20 chimpanzee serum samples. Fifty percent 

(50%) of the samples reacted serologically, in different degrees to Pan5; 40% to Pan6; 
55% to Pan7 and 60% to C68. Among the positive serum samples, one of them had 
strong neutralizing activity to all four chimp viruses. 

2. Cross-neulralization with Recombinant Viruses 

15 High-titer polyclonal antibodies were obtained to each of the simian 

adenoviruses in order to more precisely gauge the degree of cross-neutralization among 
the different serotypes. This was done by intramuscular immunization of rabbits using a 
recombinant virus containing GFP based on previously the described C68 chimpanzee 
adenovirus as an adjuvant. The serum was then used to assay for neutralizing activity 

20 against each of the three chimpanzee adenoviruses of the invention, AdC5, AdC6 and 
AdC7. A rabbit was injected with 5x1 0 12 viral particle per kg of C68CMVGFP vector 
intramuscularly and boosted 5 weeks later using the same dose. A bleed collected at the 9 
week time point revealed extremely potent neutralizing activity against C68 as well as 
Pan-5 and Pan-7 but not against Pan-6 (see Table below), indicating that the 

25 administration of a C68 (or Pan-5 and Pan-7) based vaccine could be effectively followed 
by a boost using a vector based on Pan-6. However, it has been found that this level of 
inter-relatedness does not necessarily prevent with re-administration in a setting where 
antiviral antibody titers were not as high as was achieved in this rabbit. In the following 
table, + indicates 33% CPE; ++ indicates 66% CPE; +++ indicates 100% CPE. 
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Infection on 293 cells with virus: j 




Pan5 


Pan6 


Pan7 


Pan9(C68) 


C68 

GFP \ 


Serum 
Dilution 


- 


+++ 


- 


- 




1/20 


- 


+++ 




- 




1/40 


- 


+++ 


- 


- 


- 


1/80 


- 


i i "I - 


- 


- 


— 


1/160 




+++ 




- 


- 


1/320 




+++ 


- 


- 


- 


1/640 


- 


+++ 








1/1,280 


- 


+++ 


- 




- 


1/2,560 




+++ 


- 


- 


- 


1/5,120 


+ 


+++ 


- 






1/10,240 


+ 


~\ h"t" 


++ 


- 


- 


1/20,480 


++ 


+++ 


+++ 






1/40,960 


++ 


+++ 


+++ 


+ 


+ 


1/81,920 


-H-f 


+++ 


+++ 


4+ 


++ 


1/163,840 


+++ 


+++ 


+-H- 


+++ 


,i i i 

TTT 


1/327,680 


+++ 


+++ 


+-H- 


+++ 




1/665,360 




+++ 


+++ 


+++ 


+++ 


1/1,310,720 


+++ 


+++ 


+++ 


+++ 


+++ 


1/2,621,440 
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3 . Quantitative Assay for Detection of Neutralizing Antibody 

The result was validated by a more quantitative-based assay for detecting 
neutralizing antibody, which is based on transduction of a GFP vector. Briefly, groups of 
C57BL/6 mice were immunized intramuscularly or intravenously with 5.0 x 10 10 
5 particles/ml Pan5, Pan6, Pan7 or C68. Sera from day 28 bleeds were tested for cross- 
neutralizing activity against C68CMVEGFP at dilutions of 1/20 and 1/80. In summary, 
when a pharmaceutical preparation of human immunoglobulin was tested for serological 
reactions to Pan 5, 6, and 7, and C68, some low levels of neutralizing activities against 
Pan 7 and C68 were detected. No neutralizing activity against Pan5 or Pan6 was 
10 detected. Serum samples from 36 human subjects were run for the same assay. Serum 
samples were tested at a 1/20 dilution. The results indicated that only one individual has 
clear neutralizing activity to C68. No neutralizing activity to Pan5, Pan6 or Pan7 was 
detected. 

4. In Vitro Cross-Neutralization 

15 Cross-neutralization of the simian adenoviruses by high-titer rabbit 

polyclonal antibodies raised against each of the adenoviruses Pan-5, Pan-6, Pan-7, and 
C68 was tested. 

Rabbits were immunized with intra-muscular injections of 10 13 particles 
of each of the chimpanzee adenoviruses and boosted 40 days later with the same dose 

20 with incomplete Freund's adjuvant. Sera were analyzed or the presence of neutralizing 
antibodies by incubating serial two-fold dilutions with 10 9 genome copies of each of the 
appropriate chimpanzee adenovirus vector expressing GFP and testing for the attenuation 
of GFP expression when applied to 293 cells. The serum dilution which produced a 50% 
reduction of GFP expression was scored as the neutralizing antibody titer against that 

25 particular virus. 

The results are shown in the Table. The data are consistent with the 
expectation from sequence analysis of the hexon amino-acid sequences, which indicated 
that Ad Pan-6 was likely to be the most serologically distinct compared to the other 
chimpanzee adenoviruses. 
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Infection of 293 cells with 10 9 genome copies of 


Serum from 
rabbit 
immunized 
with; 


Ad Pan-5 


Ad Pan-6 


Ad Pan-7 


AdC68 


Ad Pan-5 


1/5120 


<l/20 


1/2560 


1/2560 


Ad Pan-6 


No 

neutralization 


1/20,480 


<l/20 


<l/20 


Ad Pan-7 


1/2560 


1/160 


1/163,840 


1/2560 


AdC68 


No • 

neutralization 


<l/20 


<l/20 


1/5120 



In order to determine whether antibodies cross-reacting with the simian 
adenoviruses were likely to be of low prevalence in humans, simian adenoviruses SVl, 
SV39, and SV25 were tested for their ability to withstand neutralization when incubated 

5 with commercially available pooled human immunoglobulins (Ig). The same assay was 
also performed with AdhuS and the chimpanzee adenoviruses Pan-5, Pan-6, Pan-7, and 
C68. In a further study, sera from mice has been immunized with one of the chimpanzee 
adenoviruses C5, C6, C7, and C68 and their ability to cross-neutralize the simian 
adenoviruses SV-15, SV-23, SA-17, and Baboon Adenovirus has been tested. No cross- 

10 reactivity was observed in any case. 

Example 4 - Generation of Recombinant El -Deleted Pan5 Vector 

A modified pX plasmid was prepared by destroying the Fspl site in the bla 
gene region of pX (Clontech) by site-directed mutagenesis. The resulting modified 
15 plasmid, termed pX', is a circular plasmid of 3000 bp which contains an fl ori and an 
ampicillin resistance gene (AmpR-cds). 

A. Production of Pan-5 Adenovirus Plasmid 

A polylinker for sequential cloning of the Pan5 DNA fragments 
into pX' is created. The polylinker is substituted for the existing pX' polylinker 
20 following digestion with Mlul and EcoRI. The blunt-Fsel fragment of the Pan 5 is 

inserted into the Smal and Fsel sites of the polylinker. This fragment contains the 5* end 
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of the adenoviral genome (bp 1 to 3606, SEQ ID NO:l). The SnaBI-FspI fragment of 
Pan 5 (bp 455 to 3484, SEQ ID NO:l) is replaced with a short sequence flanked by I-Ceu 
and Pl-Sce sites from pShuttle (Clontech), to eliminate the El region of the adenoviral 
genome. The EcoRI-blunt fragment of Pan5 (bp 28658 to 36462, SEQ ID NO:l) is 
5 inserted into the EcoRI and EcoRV sites of the polylinker (to provide the 3' end of the 
adenoviral genome); the Fsel-Mlul fragment (bp 3606 to 15 135, SEQ ID NO:l) is 
inserted into the polylinker; and the MluI-EcoRI fragment is inserted into the polylinker 
(bp 15135 to 28658, SEQ ID NO:l). Optionally, a desired transgene is inserted into /- 
Ceul and Pl-Scel sites of the newly created pX'PanSAEl vector. 

10 B. Alternative Method of Generating pXTanSAEl . 

The initial plasmid pX is derived from pAdX adenovirus 
plasmid available from Clontech, as described above. Thereafter, a Pacl-Xhol region 
of pX' was deleted and the blunt-ended Pan5 polylinker was inserted into the Fsplsitc 
to generate pX'PLNK (2994 bp). The 5'end-flw?/ region of Pan 5 (bp 1-3607, SEQ 

15 ID NO: 1 ) was inserted into Smal and Fsel sites of pX'LNK to generate the pX'PanS- 
5' plasmid (6591 bp). The SnaBhNdel region of pX'PanS-5' was excised and 
replaced with the Ceu/Sce cassette, which had been PCR amplified from pRCS to 
create pX'PanS-S'AEl (4374 bp). Briefly, a sequence containing I-Ceul and Pl-Scel 
rare cutter sites was PCR amplified from pRCS (3 1 13bp). The 3' PCR primer was 

2 0 introduced an Ndel site into the PCR product. 

To extend the Pan5 DNA in pX'Pan5-5'AEl (4374 bp), the 
Fsel-Mul region of Pan 5 (bp 3607-15135, SEQ ID NO:l) is added, to create 
pX'Pan5-5'Mlu (15900 bp). The remaining MluI-3' end of the Pan5 sequence (bp 
15135-36462, SEQ ID NO:l) is added to the vector between the Mlul and EcoRV 

25 sites of the vector polylinker to form pX'Pan5AEl which contains the full-length 
Pan5 sequence containing a deletion in the El region. 

C. Generation of Recombinant Viruses 

To generate the recombinant adenoviruses from pX'PanSAEl, 
the plasmid is co-transfected with a helper expressing El, or from an El -expressing 

30 packaging cell line, such as 293 cell line or a cell line prepared as described herein. 
Hie expression of El in the packaging cell permits the replication and packaging of 
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the Pan5AEl into a virion capsid. In another embodiment, the packaging cell 
transfected with pX'PanS AE1 is transfected with an adenovirus vector as described 
above bearing the transgene of interest. Homologous recombination occurs between 
the helper and the plasmid, which permits the adenovirus-transgene sequences in the 
5 vector to be replicated and packaged into virion capsids, resulting in the recombinant 
adenovirus. 

Transfection is followed by an agar overlay for 2 weeks, after 
which the viruses are plaqued, expanded and screened for expression of the transgene. 
Several additional rounds of plaque purification are followed by another expansion of 
10 the cultures. Finally the cells are harvested, a virus extract prepared and the 

recombinant chimpanzee adenovirus containing the desired transgene is purified by 
buoyant density ultracentrifiigation in a CsCl gradient or by alternative means known 
to those of skill in the art. 

1 5 Example 5 - Generation of Recombinant El -Deleted Pan6 Vector 

A. Strategy for Construction of Pan-6 Adenoviral Plasmid 
J. Cloning of terminal fragments 

Pan 6 virus is deproteinated by pronase and proteanase K 
treatment and phenol extraction. Synthetic 1 2 bp Pme I linkers are ligated onto the viral 
20 DNA as described by Berkner and Sharp, Nucleic Acids Research, 11: 6003 (1983). The 
viral DNA is then digested with Xba I to isolate a 5* terminal fragment (6043 bp). The 
Ad6 Xbal 5' fragment is then ligated into pX link at Sma I and Xba I sites to form pX- 
AdPan6~0-16.5. The viral DNA with Pme I linkers is also digested with Pac I to isolate 
the 6475 bp 3' terminal fragment and cloned into pX link at Pac I and Sma I sites, 
2 5 resulting in pX AdPan6-82- 1 00. 

2. Deletion of El from the 5 9 clone 

To delete El (m.u. 1.2-9), the BsiWi-Xba I fragment in pX- 
AdPan6-0-16.5 is replaced with a PCR fragment spanning m.u.9-16.7 fragment treated 
with BsiWi and Xba I, leading to pX-Ad-Pan6 m.u.0-1, 9-16.5 . 
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3. Fusion of 5 ' and 3 ' clones and to create an anchor site to 
accept the middle Hind III fragment 

First, the 5' clone, pX-Ad-Pan6 m.u,0-l, 9-16.5, is further 
5 expanded by inserting the 2 nd Xba I fragment (4350 bp, m.u.16.5 - 28) from Pan 6 

genome into the Xba I site in the pX-Ad-Pan6 m.u,0-l, 9-16.5* This construct is named 
pXAd-Pan6-mu 0-1, 9-28. 

Second, the 3 J clone is also expanded by inserting the 15026 bp 
Mlu I/Pac I fragment covering m.u.41-82 from Pan 6 genome into the Mlu I/Pac I sites of 
10 pXAdPan6-82-100, generating pXAdPan6-m.u.41-100. 

Then, a 8167 bp Hind III/Eco 47III Pan 6 fragment is isolated 
from pXAd-Pan6-mu 0-1, 9-28 and subcloned into pXAdPan6-m.u.4 1-100 at Hind in and 
Xba I blunt sites. This 5' and 3' fusion clone is called pXAdPan6muO-l, 9-19.5, 64-100. 

4. Drop of the middle fragment of the genome into the fusion 

15 clone 

A 16335 bp Hind III fragment (m.u.19.5-64) from Pan 6 is 
inserted into Hind III site of pXAdPan6mu0-l , 9-19.5, 64-100 to form pXAdPan6-0-l, 9- 
100. 

5. Introduction of a PKGFP selective marker into the final 
20 construct for direct cloning the gene of interest and green/white selection of 

recombinant transformants. 

A minigene cassette that expresses GFP under a lac promoter and 
is flanked with recognition sites of rare intron encoding restriction enzymes, Pl-Sce I and 
I-Ceu I, was isolated from pShuttle-pkGFP (bare) by Sap I and Dra III digestions 

25 followed by filling-in reaction. The pShuttle-pkGFP (bare) plasmid is 4126 bp in length, 
and contains a ColEl-Ori, a kanamycin resistance gene, plac, a LacZ promoter-GFPmut3- 
1 cds (Clontech), and a GFPmut3-l cds (Clontech). This cassette is subcloned into Srf I 
cut and blunted pXAdPan6-0-l, 9-100. This final construct is called pX-Pan6-pkGFP 
mu.0-1, 9-100, which is useful for generating recombinant El-deleted Pan 6 molecular 

30 clones carrying genes of interest by direct ligation and green/white selection in 
combination with the generic pShuttlepkGFP vectors. 



65 



WO 03/046124 PCTYUS02/3364S 

B. Alternative Strategy for Generation of Pan-6 Plasmid 
1. Cloning of 5 9 terminal fragment 

The Pan 6 virus is deproteinated by pronase and 
proteanase K treatment and phenol extraction as described above and synthetic 12 bp 
5 Pme I linkers are ligated onto the viral DNA as described. The AdPan6 5' Xbal fragment 
is isolated and ligated into pX to form pX-AdPan6-0-16.5 (9022 bp) as described in Part 
A above. 

2 Deletion of El from the 5 ' clone 

To delete El (m.u. 1.2-9), pX-AdPan6-0-16.5 is digested 
10 with SnaBI and Ndel to remove the regions encoding the El a and Elb proteins (3442- 
6310 bp). This vector is subsequently digested with BsiWI in preparation for blunting 
with the minigene cassette carrying a selective marker. 

3, Introduction of a selective marker 

A minigene cassette that expressed GFP under a lac 
15 promoter and which is flanked with recognition sites of rare intron encoding restriction 
enzymes, Pl-Xcel and I-Ceul, was isolated from pShuttle-pkGFP as described above. 
The DralH-SapI fragment is then ligated with the digested pX-AdPan6-0-16.5 to form 
P X-AdPan6 MU 0-16.5AE1 (7749 bp), 

4. Extension of Pan-6 Adenoviral Sequences 

20 pX-AdPan6 MU 0-16.5AE1 was subjected to Xbal 

digestion to permit insertion of an Xbal-RsrII linker. An Xbal/RsrII digestion fragment 
from the AdPan6 genome was isolated (mu 28-100, 26240 bp) and ligated into the 
Xba/RsrII-digested pX-AdPan6 MU 0-16.5AE1 to provide pX-AdPan6 MU 0-1, 9-16.5, 
28-100. A second Xbal fragment from the Pan6 genome (mu 1 6.5-28, 4350 bp) is then 

25 ligated into this plasmid to form pX-AdPan6 MU 0-1, 9-100 (38551 bp). 

C. Generation of Recombinant Adenoviruses 

To generate the recombinant adenoviruses from a El -deleted Pan6 
plasmid prepared as described in Parts A or b, the plasmid is co-transfected with a helper 
expressing El, or from an El-expressing packaging cell line, such as 293 cell line or a 
30 cell line prepared as described herein. The expression of El in the packaging cell permits 
the replication and packaging of the Pan6-pkGFP mu.0-1, 9-100 into a virion capsid. 
Alternatively, the packaging cell transfected with pX-Pan6-pkGFP mu.0-1, 9-100 is 
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transfected with an adenovirus vector as described above bearing another transgene of 
interest. 

Example 6 - Generation of Recombinant El -Deleted Pan7 Vector 
5 A. Generation of Pan 7 Plasmids 

A synthetic linker containing the restriction sites Pacl-Smal-Fsel- 
MluI-EcoRV-PacI was cloned into pBR322 that was cut with EcoRI and Ndel. The left 
end (bpl to 361 8) of Ad Pan7 was cloned into the linker between the Srnal and Fsel sites. 
The adenovirus El was then excised from the cloned left end by cutting with SnaBI and 

10 Ndel and inserting an I-CeuI-GFP-Pl-Scel cassette from pShuttle (Clontech) in its place. 
The resulting piasmid was cut with Fsel and Mlul and Ad Pan7 fragment Fsel (bp 361 8) 
to Mlul (bp 1551 14 was inserted to extend the left end. The construct (pPan7pGFP) was 
completed by inserting the 21421 bp Ad Pan7 right end fragment from the Mlul site (bp 
15114) into the above piasmid between Mlul and EcoRV to generate a complete 

15 molecular clone of El deleted adenovirus Pan7 suitable for the generation of recombinant 
adenoviruses. Optionally, a desired transgene is inserted into the I-Ceul and Pl-Scel sites 
of the newly created pPan7 vector piasmid, 

A Construction of El -Deleted Pan 7 Viral Vectors 

To generate the recombinant adenoviruses from pPan7AEl, the 

20 piasmid is co-transfected with a helper expressing El, or from an El-expressing 

packaging cell line, such as 293 cell line or a cell line prepared as described herein. The 
expression of El in the packaging cell permits the replication and packaging of the 
Pan7AEl into a virion capsid. In another embodiment, the packaging cell transfected 
with pX'Pan7 AE1 is transfected with an adenovirus vector as described above bearing 

25 the transgene of interest. Homologous recombination occurs between the helper and the 
piasmid, which permits the adenovirus-transgene sequences in the vector to be replicated 
and packaged into virion capsids, resulting in the recombinant adenovirus. Transfection 
and purification is as described above. 

30 Example 7 - Generation of Piasmid Vectors Expressing the El Genes 

Piasmid vectors are constructed which encode the Pan5 El region gene, and these 
plasmids are used to generate stable cell lines expressing viral El proteins. 
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Hie El region of Pan5 is cloned into pX\ essentially. as described in Example 4 
above, prior to replacement of this region with the fragment from pShuttle (Clontech). 
The expression plasmid contains the Pan5 adenoviral genome sequence spanning at least 
bp 1 to 3959 in the Pan5 genomic sequence. Thus, the expression plasmid contains the 
5 sequence encoding Fla and Elb of chimpanzee Ad Pan5 under the control of a 

heterologous promoter. Similar expression plasmids can be generated using the Ad Pan6 
and AdPan 7 El regions, identified in the tables above. 

Example 8 - Generation of Cell Lines Expressing Chimpanzee Adenovirus El Proteins 

1 0 Cell lines expressing viral El proteins are generated by transfecting HeLa (ATCC 

Acc. No. CCL2) with the plasmid of Example 6. These cell lines are useful for the 
production of El -deleted recombinant chimpanzee adenoviruses by co-transfection of 
genomic viral DNA and the expression plasmids described above. Transfection of these 
cell lines, as well as isolation and purification of recombinant chimpanzee adenoviruses 

15 therefrom are performed by methods conventional for other adenoviruses, i.e., human 
adenoviruses [see, e.g., Horwitz, cited above and other standard texts], 
A. Cell lines expressing Pan5 El proteins 

HeLa cells in 10cm dishes are transfected with 10 \ig of pX-Pan51-El 
DNA using a Cellphect™ kit (Pharmacia, Uppsala, Sweden) and following the 

20 manufacturer's protocol. 22 hours post-transfection, the cells are subjected to a three 
minute glycerol shock (15% glycerol in Hepes Buffered Saline, pH 7.5) washed once in 
DMEM (HeLa) or F12K (A549; Life Technologies, Inc., Grand Island, NY) media 
supplemented with 10% FCS, 1% Pen-Strep, then incubated for six hours at 37°C in the 
above described media. The transfected cells are then split into duplicate 15cm plates at 

2 5 ratios of 1 :20, 1 :40, 1 :80, 1 : 1 60, and 1 :320. Following incubation at 37°C overnight, the 
media is supplemented with G41 8 (Life Technologies, Inc.) at a concentration of 1 jig/ml. 
The media is replaced every 5 days and clones are isolated 20 days post-transfection, 

HeLa El cell clones are isolated and assayed for their ability to augment 
adeno-associated virus (AAV) infection and expression of recombinant LacZ protein as 

30 described below. 
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B. AAV Augmentation Assay for Screening El Expressing Cell 

Lines 

AAV requires adenovirus-encoded proteins in order to complete 
its life cycle. The adenoviral El proteins as well as the E4 region-encoded ORF6 protein 
5 are necessary for the augmentation of AAV infection. An assay for El expression based 
on AAV augmentation is used. Briefly, the method for identifying adenoviral El- 
expressing cells comprises the steps of infecting in separate cultures a putative adenovirus 
El-expressing cell and a cell containing no adenovirus sequence, with both an adeno- 
associated virus (AAV) expressing a marker gene and an AAV expressing the ORF6 of 

10 the E4 gene of human adenovirus, for a suitable time. The marker gene activity in the 
resulting cells is measured and those cells with significantly greater measurable marker 
activity than the control cells are selected as confirmed El -expressing cells. In the 
following experiment, the marker gene is a lacZ gene and the marker activity is the 
appearance of blue stain. 

1 5 For example, the cell lines described above, as well as 

untransfected control cells (HeLa) are infected with 100 genomes per cell of an AAV 
vector bearing a marker gene, e.g., AVLacZ [K. Fisher et al, J. Virol., 70:520 (1996)] 
and an AAV vector expressing the ORF6 region of human 5 (AV.orfS). The DNA 
sequence of the plasmid generates a novel recombinant adeno-associated virus (rAAV) 

2 0 containing the LacZ transgene and the Ad E4 ORF 6, which is an open read ing frame 
whose expression product facilitates single-stranded (ss) to double-stranded (ds) 
conversion of rAAV genomic DNA. These vectors are incubated in medium containing 
2% FCS and 1% Pen-Strep at 37°C for 4 hours, at which point an equal volume of 
medium containing 10% FCS is added. It should be understood by one of skill in the art 

2 5 that any marker gene (or reporter gene) may be employed in the first AAV vector of this 
assay, e.g., alkaline phosphatase, luciferase, and others. An antibody-en2yme assay can 
also be used to quantitate levels of antigen, where the marker expresses an antigen. The 
assay is not limited by the identity of the marker gene. Twenty to twenty-four hours 
post-infection, the cells are stained for LacZ activity using standard methods. After 4 

30 hours the cells are observed microscopically and cell lines with significantly more blue 
cells than the A549 or HeLa cell controls are scored as positive. 
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Example 9 - Delivery of Transgene to Host Cell 

The resulting recombinant chimpanzee adenovirus described in Example 4, 5 or 6 
above is then employed to deliver the transgene to a mammalian, preferably human, cell. 
For example, following purification of the recombinant virus, human embryonic kidney 
5 293 cells are infected at an MOI of 50 particles per cell. GFP expression was 
documented 24 hours post-infection. 

A. Gene Transfer in Mouse Models via Pan-6, Pan-7, and Pan-9 vectors 
Gene transfer efficiencies and toxicological profile of recombinant 

chimpanzee adenoviruses were compared in mouse liver directed gene transfer, mouse 

10 lung directed gene transfer, and mouse muscle directed gene transfer. 

El -deleted adenoviral vectors containing LacZ under the control of the 
CMV promoter were constructed using the techniques herein for human Ad5, chimpanzee 
Pan 6, chimpanzee Pan 7 and chimpanzee Pan 9 (C68). The vectors were delivered to 
immune-deficient NCR nude mice (80 for each study) as follows. For the liver study, 100 

15 ill (1 xlO 11 particles) were injected into the tail vein. For the lung study, 50 jil (5 xl0 !0 
particles) were delivered intratracheal ly. For the muscle study, 25 \i\ (5 xl 0 l0 particles) 
were injected into tibialis anterior. The mice were sacrificed on days 3, 7, 14 and 28 post- 
vector injection (5 animals per group at each time point). At each necropsy, the 
liver/lung/muscle tissue was harvested and prepared for cryoblocks and paraffin 

20 embedding. The cryoblocks were sectioned for X-gal staining and the paraffin sections 
are H&E stained for histopathic analysis. At each time point, terminal bleeding was 
performed. Serum samples were subjected to liver function tests. 

It was observed in this experiment the chimpanzee adenoviruses Pan-6, Pan-7, 
and Pan-9 were less efficient than huAd5 in gene transfer to the liver and to the lung. 

25 However, this may be desirable in certain circumstances, to reduce liver toxicity observed 
for huAdS. The gene transfer efficiency in muscle varied less between serotypes. 

B. Mouse study to feasibility of re-administration of adenovirus vectors by 
serotype switching between AdhuS, Pan-6, Pan-7, and Pan-9 vectors 

Mice were administered (C57/B16; 4/group) LacZ vectors based on 
30 huAd5, Pan-6, Pan-7, and Pan-9 (H5.040CMVLacZ, Pan6.000CMVLacZ, 

Pan7.000CMVLacZ, Pan9.000CMVLacZ; 10 n particles/injection) by tail vein. Thirty 
days later the mice were re-administered adenovirus vectors expressing al -antitrypsin 
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(H5.040CMVhAlAT, Pan6.000CMVhAlAT, 1x10" particles, Pan7.000CMVhAlAT, 
Pan9.000CMVhAlAT, 10 n particles/injection). Successful transduction by the re- 
administered vector is monitored by measuring serum al-antitrypsin on days 3 and 7, 
following re-administration. 

The ability of adenovirus vectors based on huAd5, Pan-6, Pan-7, and Pan- 
9 respectively to tnusduce the livers of mice in the presence of neutralizing antibodies to 
the other serotypes was determined. The results are tabulated here. 



1^ Itlld^tiAn 

i injection 


z injection 


Cross-neutralization 


Adhu5 


AdhuS 


Yes (+ve control) 




Pan-6 


No 




Pan-7 


No 




Pan-9 (C68) 


No 


Pan-6 


AdhuS 


No 




Pan-6 


Yes (+ve control) 




Pan-7 ■ 


Yes 




Pan-9 (C68) 


No 


Pan-7 


Adhu5 


No 




Pan-6 


Yes 




Pan-7 


Yes (+ve control) 




Pan-9 (C68) 


Yes 
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I st injection 


z injection 


Cross-neutralization 


Pan-9 (C68) 


Adhu5 


No 




Pan-6 


No 




Pan-7 


Yes 




Pan-9 (C68) 


Yes (+ve control) 



Ability of vectors to transduce murine liver in the presence of neutralizing 
antibodies to other serotypes. 
5 Thus, immunization with huAdS does not prevent re-administration with either of 

the chimpanzee adenovirus vectors Pan-6, Pan-7, or Pan-9 (C68). This experiment also 
appears to indicate that Pan-7 is between Pan-6 and Pan-9 in the spectrum of antigenic 
relatedness and cross-reacts with both; however Pan-6 and Pan-9 do not neutralize each 
other. This is a surprising result based on homology comparisons, which indicates that 
10 Pan-6 is quite distinct from Pan-7 and Pan-9. Evaluation of antisera generated against 
Pan-9 indicated no cross-neutralization against Pan-6 but some neutralization against Pan- 
7, arguing that Pan-6 is distinct from the others. 

1 5 Example 1 0 - Generation of Recombinant El -Deleted SV-25 Vector 

A plasmid was constructed containing the complete SV-25 genome except for an 
engineered El deletion. At the site of the El deletion recognition sites for the restriction 
enzymes I-Ceul and Pl-Scel which would allow insertion of transgene from a shuttle 
plasmid where the transgene expression cassette is flanked by these two enzyme 

20 recognition sites were inserted. 

A synthetic linker containing the restriction sites Swal-SnaBI-Spel-Aflll-EcoRV- 
Swal was cloned into pBR322 that was cut with EcoRI and Ndel. This was done by 
annealing together two synthetic oligomers SV25T (5'-AAT TTA AAT ACG TAG CGC 
ACT AGT CGC GCT AAG CGC GGA TAT CAT TTA AA-3\ SEQ ID NO: 38) and 

25 SV25B (5'-TAT TTA AAT GAT ATC CGC GCT TAA GCG CGA CTA GTG CGC 
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TAC GTA TTT A-3\ SEQ ID NO:39) and inserting it into pBR322 digested with EcoRI 
and NdeL The left end (bpl to 1057, SEQ ID NO:29) of Ad SV25 was cloned into the 
above linker between the SnaBI and Spel sites. The right end (bp28059 to 31042, SEQ 
ID NO: 29) of Ad SV25 was cloned into the linker between the Aflll and EcoRV sites. 
5 The adenovirus El was then excised between the EcoRI site (bp 547) to Xhol (bp 203 1) 
from the cloned left end as follows. A PCR generated 1-CeuI-PI-SceI cassette from 
pShuttle (Clontech) was inserted between the EcoRI and Spel sites. The 10154 bp Xhol 
fragment of Ad SV-25 (bp2031 to 12185, SEQ ID NO:29) was then inserted into the Spel 
site. The resulting plasmid was digested with Hindlll and the construct (pSV25) was 

10 completed by inserting the 18344 bp Ad SV-25 Hindlll fragment (bpl 1984 to 30328, 
SEQ ID NO:29) to generate a complete molecular clone of El deleted adenovirus SV25 
suitable for the generation of recombinant adenoviruses. Optionally, a desired transgene 
is inserted into the I-Ceul and Pl-Scel sites of the newly created pSV25 vector plasmid. 
To generate an AdSV25 carrying a marker gene, a GFP (green fluorescent 

15 protein) expression cassette previously cloned in the plasmid pShuttle (Clontech) was 
excised with the restriction enzymes I-Ceul and Pl-Scel and ligated into pSV25 (or 
another of the Ad chimp plasmids described herein) digested with the same enzymes. 
The resulting plasmid (pSV25GFP) was digested with Swal to separate the bacterial 
plasmid backbone and transfected into the El complementing cell line HEK 293. About 

20 10 days later, a cytopathic effect was observed indicating the presence of replicative 

virus. The successful generation of an Ad SV25 based adenoviral vector expressing GFP 
was confirmed by applying the supernatant from the transfected culture on to fresh cell 
cultures. The presence of secondarily infected cells was determined by observation of 
green fluorescence in a population of the cells, 

25 

Example 1 1 - Construction of E3 deleted Pan-5, Pan-6, Pan-7 and C68 vectors 

In order to enhance the cloning capacity of the adenoviral vectors, the E3 region 
can be deleted because this region encodes genes that are not required for the propagation 
of the virus in culture. Towards this end, E3-deleted versions of Pan-5, Pan-6, Pan-7, and 
30 C68 have been made (a 3.5 kb Nru-Avrll fragment containing E3 1-9 is deleted). 
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A . E3 deleted Pan5 based vector 

El -deleted pPan5-pkGFP plasmid was treated with Avr II endonuclease to 
isolate a 5.8 kb fragment containing the E3 region and re-circulate pPanS-pkGFP with 
Avr II deletion to form construct pPan5-pkGFP-E3-Avr II. Subsequently, the 5.8 kb Avr 
5 II fragment was subcloned into pSL-Pan5-E3-Avr II for a further deletion of E3 region by 
Nru I digestion. This led to a plasmid pSL-Pan5-E3-deletion. The final construct pPan5- 
E3-pkGFP was produced by removing a 4.3 kb Avr II/Spe I fragment from pSL-Pan5-E3- 
deletion plasmid and inserting into pPan5-pkGFP-E3-Avr II at Avr II site. In this final 
construct, a 3.1 kb deletion in E3 region was accomplished. 
10 B. E3 deletion in Pan6 based vector 

El-deleted pPan6- pkGFP molecular clone was digested with Sbf I and 
Not I to isolate 19.3 kb fragment and ligated back at Sbf I site. The resulting construct 
pPan6-Sbf I-E3 was treated with Eco 47 III and Swa I, generating pPan6-E3. Finally, 21 
kb Sbf 1 fragment from Sbf I digestion of pPan6- pkGFP was subcloned into pPan6-E3 to 
15 create pPan6-E3-pkGFP with a 4 kb deletion in E3. 

C. E3 deleted Pan? and Pan9 vectors 

The same strategy was used to achieve E3 deletions in both vectors. First, 
a 5.8 kb Avr II fragment spanning the E3 region was subcloned pSL-1 180, followed by 
deletion of E3 by Nru I digestion. The resulting plasmids were treated with Spe I and Avr 
20 II to obtain 4.4 kb fragments and clone into pPan7- pkGFP and pPan9-pkGFP at Avr II 
sites to replace the original E3 containing Avr II fragments, respectively. The final pPan7- 
E3- pkGFP and pPan9-E3- pkGFP constructs have 3.5 kb E3-deletions. 

Example 12 - Construction of E3- and E4-deleted Pan-7 vector 
25 Although the deletion of the El region of adenoviruses (first generation 

adenovirus vectors) renders them replication-incompetent, expression of the adenoviral 
vector backbone genes is not fully abolished. Deletion of the E4 region considerably 
attenuates this residual gene expression and may confer a safety advantage. An E4- 
deleted Pan-7 vector containing a 2.5 kb deletion (a PvuII-Agel fragment containing 
30 E40RF1 -ORF7 is deleted) has been constructed. High titer stocks of this virus were 
generated using a HEK 293-based cell line, which in addition to El y expresses an 
essential E4 gene (orf 6). 
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1 . E4 deletion in the molecular clone of Pan! 

A 1 9 kb Xba I fragment was deleted from pPan7- pkGFP to create pPan7- 
Xba I from which a 2.5 kb E4 fragment was deleted by Age I and Pvu II partial digestion, 
resulting in pPan7-Xba I-E4. pPan7-E4- pkGFP piasmid was generated from pPan7-Xba 
5 I-E4 in two sequential cloning steps, adding 19 kb Xba I and 1 5 kb I-Ceu I/MIu 

I fragments, both of which came from pPan7- pkGFP construct. 

2. Introduction of E3 and E4 deletions in Pan9 vector 

A 1 1 kb piasmid, pPan9-EcoRI, containing E4 region was created by 
retrieving 1 1 kb EcoRI fragment from pPan9 pkGFP after EcoRI digestion and self- 
10 ligation. E4 region was deleted from this construct by Age I digestion/filled in and Pvu 

II partial digestion and slef- ligation to generate pPan9-EcoR I-E4. A 23 kb 
EcoR I fragment was isolated from pPan9-pkGFP and inserted into pPan9-EcoR 
I-E4 at EcoR I site, followed by adding 5.8 kb Avr II fragment from 
pPan9-pkGFP, to form the final product pPan9-E3-E4- pkGF. Compared to the 

1 5 genome size of wild type Pan9, this E 1 -E3-E4-deleted vector could have a 
transgene capacity up to 8 kb. 

3. Introduction ofminigene cassettes with genes of interest including 
reporter genes, glyco- and nuclear proteins of Ebo into molecular clones of Pan 
vectors 

20 A highly efficient direct cloning and green/white selection procedure was 

employed for creating molecular clones of recombinant viruses. Briefly, genes of interest 
were cloned into pShuttlepkGFP by screening white colonies for recombinants. 
Subsequently, the minigene cassettes were transferred into chimpanzee 
adenovirus backbone plasmids, pPanX-pkGFP with various deletions, easily by 

25 swapping with pkGFP cassette at I-Ceu I and Pl-Sce I sites and screening a 
few white colonies for correct recombinants. 

4. Rescue of molecular clones of Pan vectors with multiple deletions in 
early regions and virus propagation 

For rescue of El-E3-deIeted molecular clones of chimpanzee adenovirus 
30 vectors, the clones were linearized with appropriate restriction enzymes and transfected 
into regular 293 cells. Once a full cytopathic effect (CPE) observed in the transfected 
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cells, crude lysale was harvested and expanded in 293 cells to large-scale infections. The 
viruses were purified by CsCl sedimentation method. 

For E1-E4 and El-E3-E4-deleted Pan vectors, 10-3 cells, a 293-based 
El-E4-complementingcell line, were used for rescue and propagation of vectors. E4 
5 ORF6 gene expression in 10-3 cells was induced by addition of 1 50 uM ZnSO 4 to the 
culture medium. 

Example 13 - Vaccination with adenovirus vectors expressing wild type and variant 
EboZ GP. 

10 AdHuS or AdC7 vectors expressing Ebola envelope chimeras were 

produced for in vivo immunization experiments in C57BL/6 mice. Recombinant 
viruses with different viral backbones were created by molecular cloning method 
in which the minigene cassettes were inserted into the place of El -deletions. The 
molecular clones of all recombinant viruses were rescued and grown up in 293 

1 5 cells for large-scale purification using CsCl sedimentation method. 

Five EboZ variants encoded by AdHu5 or AdPan7 (C7) were selected and 
produced to evaluate their relative immunogenicity following an intramuscular 
Ad injection. The wt Ebo, a soluble Ebo variant, EboAl, EboA2, EboA3, EboA4, 
EboA5S, EboA6S, EboA7S and EboA8S were evaluated in the initial vaccine 

2 0 studies. For the data summarized in the following table, the number of viral 
particles (per ml or total) produced and amplified from infected 293 cells was 
established by spectrophotometry reading. 
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Table: Production of Adhu5 or AdC7 Adenoviral vector encoding EboZ variant. 





HuAd5 




AdC7 




Gene 


Titer 

(VPx 

10 12 /ml) 


Total 
yield 
(VPx 
10 n ) 


Titer 

(VPx 

10 l2 /ml) 


Total 
yield 
(VPx 
10 12 ) 


Ebo wt 


2.6 


12 


4.3 


43 


EboS 


4.9 


49 


4.6 


55 


EboA2 


2.1 


9 


5.8 


93 


EboA3 


1.7 


8 


5.3 


95 


EboA4 


3 


12 


4.1 


62 



Vector was administered intramuscularly (10 n genome copies/cell) in C57BL/6 
5 mice and the presence of virus neutralizing antibody (VNAO was evaluated 28 days later 
as a first measure of an immune response generated against the Ebola envelope 
glycoprotein. VNA is defined here as serum antibody able to inhibit transduction of HeLa 
cells mediated by HIV-based vector pseudotyped with the wild-type Ebola envelope. 
VNA to the EboZ pseudotypes was detected with AdPan7 (C7) yielding higher 
10 titers than AdHu5. The EboZA3 elicited the highest VNA in terms of the transgene 
targets. For the data summarized in the following table, neutralizing antibody titers to 
HTV-EboZ-GFP pseudotypes (reciprocal dilution) are provided (N=5 animals/group). 

VNA Titers 

EboZ wildtype EboZs EboZA3 
AdHu5 12 16 12 

AdC7 44 12 140 
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Example 14 - Pan7-raediated Expression of Ebola Proteins 

Mouse studies to evaluate Pan-7 vectors expressing Ebola envelope proteins and 
the Ebola nuclear antigen have been initiated. These are directed towards evaluation of 
neutralizing antibodies in C57B1/6 mice injected intramuscularly (IM) with Adhu5 or 
5 Pan-7 expressing each of 4 Ebola env constructs. 

A. Evaluation o/CTL from C57BU6 mice injected IMwith AdhuS or Pan- 
7 expressing the Ebola env constructs. 

1 . Challenge experiment in mice with Ebola vims. 

Neutralizing antibody (NAB) responses to the Ebola envelope 

10 were analyzed by looking at immunized mouse sera mediated neutralization of a lenti viral 
(HIV) vector pseudotyped with the several constructs (eEbo, NTD2, NTD3, NTD4) of the 
Ebola envelope glycoprotein. C57BL/6 or BALB/c mice received a single intramuscular 
injection of 5 x 10 10 particles per mouse of C7 (Ad Pan-7) encoding Ebola envelope 
variant Neutralizing antibody was evaluated 30 days post-vaccination. Briefly, Ebola 

1 5 Zaire pseudotyped HIV vector encoding for B-galactosidase (EboZ-HIV-LacZ) was 
incubated for 2 hr at 37°C with different dilution of heat inactivated mouse serum.. 
Following the incubation with serum, EboZ-HI V-LacZ was then used to infect HeLa cells 
for 16 hr at 37°C. Infectivity was revealed by X-gal staining of transduced HeLa cells 
positive for B-galactosidase. Neutralizing titer represent the serum reciprocal dilution 

20 where a 50% decrease in the number of B-galactosidase positive blue cells was observed. 
Sera were collected 30 days post-immunization, which consisted in a single intramuscular 
(I.M.) administration of 5 x 10 10 particles/animal. Neutralizing antibody to Ebola 
pseudotyped HIV could be detected from all groups with antibody titers ranging from 20 
for Ad-EboZ (AdhuS expressing EboZ), Ad-NTD3 (AdhuS expressing NTD3) and C7- 

25 sEbo (Ad Pan-7 expressing soluble EboZ) to over 130 for C7-NTD3 (Ad Pan-7 

expressing soluble NTD3) and C7-NTD4 (Ad Pan-7 expressing soluble NTD3). The same 
immunization strategy in BALB/c mice resulted in lower neutralizing antibody titers for 
Ad- and C7-NTD2, and NTD4. 

B. Cellular Immune Response 

30 The cellular immune response to the Ebola envelope in C57BL/6 mice 

was evaluated 8 days after a single I.M. administration of 5 x 10 10 particles of C7-LacZ or 
C7-Ebola envelope variant per animal. Mice were vaccinated LM. with 5 x 10 10 particles 
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i 



of C7 encoding LacZ or Ebola envelope variant. Splenic lymphocytes from immunized 
mice were collected 8 days post vaccination and stimulated in vitro with feeder cells 
(splenic lymphocytes from untreated mice infected with human Adenovirus serotype 5 
encoding for the wild-type Ebola envelope and irradiated). Standard 5-hr CTL assays 
5 were performed using 5l Cr-labeled syngenic C57 cells transfected with an expressor of 
EboZ. 

A positive MHC-restricted cytotoxic T lymphocyte (CTL) response was observed 
from all AdPan-7 encoding for Ebola envelope variants with a higher response from 
NTD2, NTD3 or NTD4 immunized mice. Indeed, effector cells from C7 encoding Ebola 

10 envelope variant immunized mice recognized EboZ transfected target cells and gave 

recall CTL responses up to 30% specific lysis. Less than 5% lysis was seen with effector 
cells from naTve or LacZ immunized control mice confirming that lysis was specific for 
Ebola envelope antigens. 

C Protection Studies 

15 The most direct means of evaluating C7 (Ad Pan-7) encoding for the 

EboZ variants as a successful vaccine in mice was to assess protection against weight loss 
and death following lethal challenge with mouse adapted Ebola Zaire virus. BALB/c mice 
were immunized with a single dose of 5 x 10 10 particles per animal as performed 
previously and vaccinated animals were challenged with 200 LD 50 of mouse adapted 

20 Ebola Zaire 21 days later. All control mice (vehicle and C7-LacZ) died between day 5 to 
day 9 post-challenge. In contrast, all vaccinated mice but one, (from the C7-sEbo group), 
survived the challenge with Ebola Zaire. 

Weight loss was observed from mice vaccinated with C7-sEbo from day 4 to day 
7. Signs of illness such as pilo-erection and from light to severe lethargy were also noted 

25 from mice vaccinated with C7-sEbo, NTD2 and NTD3 between day 4 to day 7. Mice 
immunized with C7-EboZ and C7-NTD4 did not show sign of illness. Overall, a single 
dose of C7-EboZ and C7-NTD4 completely protected immunized mice from illness and 
death possibly due to a significant T cell mediated immunity. 
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All documents recited above are incorporated herein by reference. Numerous 
modifications and variations of the present invention are included in the scope of the 
above-identified specification and are expected to be obvious to one of skill in the art. 
5 Such modification? and alterations to the compositions and processes of the present 
invention, such as selections of different minigenes or selection or dosage of the 
vectors or immune modulators are believed to be within the scope of the claims 
appended hereto. 



80 



WO 03/046124 



PCTAJS02/33645 



WHAT IS CLAIMED IS: 



1. An isolated simian adenovirus nucleic acid sequence selected from the 

group consisting of: 

(a) Pan5 having the sequence of nucleic acids 1 to 36462 of SEQ ID 

NO:l; 

(b) Pan6 having the sequence of nucleic acids 1 to 36604 of SEQ ID 

NO:5; 

(c) Pan7 having the sequence of nucleic acids 1 to 36535 of SEQ ID 

NO:9; 

(d) SV1 having the sequence of nucleic acids 1 to 34264 of SEQ ID NO: 

24; 

(e) SV25 having the sequence of nucleic acids I to 31044 of SEQ ID NO: 

29; 

(f) SV39 having the sequence of to nucleic acids 1 to 341 1 5 of SEQ ID 

NO: 34, and 

(g) a nucleic acid sequence complementary to the sequence of any of (a) 

to(f). 

2, An isolated simian adenovirus serotype nucleic acid sequence selected 
from one or more of the group consisting of: 

(a) 5 ' inverted terminal repeat (UR) sequences; 

(b) the adenovirus El a region, or a fragment thereof selected 
from among the 13S, 12S and 9S regions; 

(c) the adenovirus Elb region, or a fragment thereof selected 
from among the group consisting of the small T, large T, DC, and IVa2 regions; 

(d) the E2b region; 

(e) the LI region, or a fragment thereof selected from among 
the group consisting of the 28.1 kD protein, polymerase, agnoprotein, 52/55 kD 
protein, and Ula protein; 
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(f) the L2 region, or a fragment thereof selected from the group 
consisting of the penton, VII, VI, and Mu proteins; 

(g) the L3 region, or a fragment thereof selected from the group 
consisting of the VI, hexon, or endoprotease; 

(h) the 2a protein; 

(i) the L4 region, or a fragment thereof selected from the group 
consisting of the 100 kD protein, the 33 kD homolog, and VIE; 

(j) the E3 region, or a fragment thereof selected from the group 
consisting of E3 ORFi; E3 ORF2, E3 ORF3, E3 ORP4, E3 ORF5, E3 ORF6, E3 
ORF7, E3 ORF8, and E3 ORF9; 

(k) the L5 region, or a fragment thereof selected from a fiber 

protein; 

(1) the E4 region, or a fragment thereof selected from the group 
consisting of E4 ORF?, E4 ORF6, E4 OKF4, E4 ORF3, E4 ORF2, and E4 ORFl;and 

(m)the3'ITR, 

of any of Pan5 SEQ ID NO:l; Pan6 SEQ rD NO:5; Pan7 SEQ ID NO:9; 
SV1 SEQ ID NO: 24; SV25 SEQ ID NO: 29; and SV39 SEQ ID NO: 34, or sequence 
complementary to any of (a) to (m). 

3. A simian adenovirus protein encoded by the nucleic acid sequence 
according to claim 2. 

4. A nucleic acid molecule comprising a heterologous simian adenoviral 
sequence according to claim 2. 

5. The nucleic acid molecule according to claim 4, wherein said simian 
adenoviral sequence encodes an adenoviral gene product and is operatively linked to 
regulatory control sequences which direct expression of the adenoviral gene product in a 
host cells. 
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6. The nucleic acid molecule according to claim 4 or 5, wherein said simian 
adenoviral sequence comprises the El a region of Pan5 SEQ ID NO':l; Pan6 SEQ ID 
NO:5; Pan7 SEQ ID NO:9; SV1 SEQ ID NO: 24; SV25 SEQ ID NO: 29; and SV39 SEQ 
ID NO: 34. 

7. A pharmaceutical composition comprising the nucleic acid molecule 
according to claim 6 and a physiologically compatible carrier. 

8. An isolated simian adenoviral capsid protein selected from the group 
consisting of: 

(a) a hexon protein of Pan5 SEQ ID NO:3, Pan6 SEQ ID NO:7, 
Pan7 SEQ ID NO:l 1, SV1 SEQ ID NO:26, SV25 SEQ ID NO:31or SV39 SEQ ED 
NO:36, or fragment thereof; 

(b) a penton protein of Pan5 SEQ ID NO:2, Pan6 SEQ ID 
NO:6, Pan7 SEQ ID NO: 10, SV1 SEQ ID NO:25, SV25 SEQ ID NO: 30 or SV39 
SEQIDNO:35; 

(c) a fiber protein of Pan5 SEQ ID NO:4, Pan6 SEQ ID NO:8, 
Pan7 SEQ ID NO: 12, SV1 SEQ ID NO: 27 and SEQ ID NO:28, SV25 SEQ ID NO: 
32 and SEQ ID NO:33 or SV39 SEQ ID NO: 37, or a fragment thereof. 

9. An artificial adenovirus serotype comprising a capsid protein 
according to claim 8 or a fragment thereof. 

10. The artificial adenovirus serotype according to claim 9, wherein said 
capsid comprises a fragment of the hexon protein selected from the group consisting 
of Pan5 SEQ ID NO: 15, Pan6 SEQ ID NO: 16 and Pan7 SEQ ID NO: 17. 

1 1 . The artificial adenovirus serotype according to claim 9 or 1 0, wherein 
said capsid comprises a fragment of the fiber protein selected from the group 
consisting of Pan6 SEQ ID NO: 19, Pan7 SEQ ID NO:20 and Pan5 SEQ ID NO:21. 
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12. A nucleic acid molecule comprising a heterologous sequence encoding 
a protein according to claims 3 or 8 or an artificial adenovirus serotype according to 
any of claims 9 to 11. 

13. A recombinant vector comprising a simian adenovirus sequence 
according to claim 2 or a nucleic acid molecule according to claim 4 or 12 and a 
heterologous gene operatively linked to sequences which direct expression of said 
gene in a host cell. 

14. The recombinant vector according to claim 1 3 , further comprising 5 ' 
and 3' adenovirus cis-elements necessary for replication and encapsidation. 

15. The recombinant vector according to claim 13 or claim 14, wherein 
said vector is a virus. 

1 6. The recombinant vector according to any of claims 1 3 to 1 5 wherein 
said vector lacks all or apart of the El gene. 

1 7. A recombinant vims comprising a simian capsid protein according to 
. claim 3 or an artificial adenovirus serotype according to any of claims 9 to 1 1. 

1 8. A host cell comprising a nucleic acid molecule according to any of 
claims 4 to 6 or 12 , a recombinant vector according to any of claims 13 to 16, or a 
recombinant virus according to claim 17. 

19. The host cell according to claim 18, wherein said host cell is stably 
transformed with the nucleic acid molecule or the recombinant vector. 
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20. The host cell according to claim 1 8 or claim 19, wherein said host cell 
expresses one or more adenoviral gene products from said nucleic acid molecule or 
recombinant vector, said adenoviral gene products selected from the group consisting 
of El a, Elb, E2a, and E4 ORF6. 

21 . The host cell according to any of claims 18 to 20, wherein said host 
cell is stably transformed with a nucleic acid molecule comprising the simian 
adenovirus inverted terminal repeats. 

22. A composition comprising a recombinant vector in a phannaceutically 
acceptable carrier, said vector comprising a simian adenovirus sequence according to 
any of claims 1 or 2 and a selected heterologous gene operatively linked to regulatory 
sequences which direct expression of said gene in a host cell. 

23. A method for delivering a heterologous gene to a mammalian cell 
comprising introducing into said cell an effective amount of the vector of any of 
claims 13 to 16 or a virus according to claim 17. 

24. A method for repeat administration of a heterologous gene to a 
mammal comprising the steps of: 

(a) introducing into said mammal a first vector which 
comprises the heterologous gene and 

(b) introducing into said mammal a second vector which 
comprises the heterologous gene; 

wherein at least the first virus or the second vector is a virus according 
to claim 17 and wherein the first and second recombinant vector are different. 

25 . A method for producing a selected gene product comprising infecting a 
mammalian cell with the vector of any of claims 13 to 16 or a virus according to 
claim 17, culturing said cell under suitable conditions and recovering from said cell 
culture the expressed gene product 
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26. A method for eliciting an immune response in a mammalian host 
against an infective agent comprising administering to said host an effective amount 
of the recombinant adenovirus of claim 17, wherein said heterologous gene encodes 
an antigen of the infective agent. 

27. The method according to claim 26, comprising the step of priming the 
host with a DNA vaccine comprising the heterologous gene prior to administering the 
recombinant adenovirus . 

28. A composition comprising a simian adenovirus capsid protein 
according to claim 8 linked to a heterologous molecule for delivery to a selected host 
cell. 

29. A method for targeting a cell having an adenoviral receptor comprising 
delivering to a subject & composition according to claim 28. 
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SEQUENCE LISTING 

<110> The Trustees of the University of Pennsylvania 
Wilson, James M. 
Gao, Guangping 
Roy, Soumitra 

<120> Simian Adenovirus Nucleic Acid and Amino Acid Sequences, 
Vectors Containing Same, and Methods of Use 

<130> UPN-02677PCT 

<150> US 60/331,951 
<151> 2001-11-21 

<150> US 60/366,798 
<151> 2002-03-22 

<160> 39 

<170> Patentln version 3..1 

<210> 1 

<211> 36462 

<212> DNA 

<213> chinqpanzee adenovirus serotype PanS 

<220> 

<221> CDS 

<222> (13898) . • (15490) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (18315) • • (21116) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (32035) (33372) 

<223> L5 Fiber 

<400> 1 



catcatcaat 


aatatacctc 


aaacttttgg tgcgcgttaa tatgcaaatg aggtatttga 


60 


atttggggat 


gcggggcggt 


gattggctgc gggagcggcg accgttaggg gcggggcggg 


120 


tgacgttttg 


atgacgtggc 


cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 


180 


gacgtcaaac 


gaggtgtggt 


ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 


240 


ggaaatgagg 


tgtttctggg 


cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 


300 


aatgaggaag 


tgaaaatctg 


agtaattccg cgtttatggc agggaggagt atttgccgag 


360 


ggccgagtag 


actttgaccg 


attacgtggg ggtttcgatt accgtatttt tcacctaaat 


420 
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ttccgcgtac 


ggtgtcaaag 


tccggtgttt ttacgtaggt 


gtcagctgat 


cgccagggta 


480 


tttaaacctg 


cgctctctag 


tcaagaggcc actcttgagt 


gccagcgagt 


agagttttct 


540 


cctccgcgcc 


gcgagtcaga 


tctacacttt gaaagatgag 


gcacctgaga 


gacctgcccg 


600 


gtaatgtttt 


cctggctact 


gggaacgaga ttctggaact 


ggtggtggac 


gccatgatgg 


660 


gtgacgaccc 


tccggagccc 


cctaccccat ttgaagcgcc 


ttcgctgtac 


gatttgtatg 


720 


atctggaggt 


ggatgtgccc 


gagaacgacc ccaacgagga 


ggcggtgaat 


gatttgttta 


780 


gcgatgccgc 


gctgctggct 


gccgagcagg ctaatacgga 


ctctggctca 


gacagcgatt 


840 


cctctctcca 


taccccgaga 


cccggcagag gtgagaaaaa 


gatccccgag 


cttaaagggg 


900 


aagagctcga 


cctgcgctgc 


tatgaggaat gcttgcctcc 


gagcgatgat 


gaggaggacg 


960 


aggaggcgat 


tcgagctgca 


gcgaaccagg gagtgaaaac 


agcgagcgag 


ggctttagcc 


1020 


tggactgtcc 


tactctgccc 


ggacacggct gtaagtcttg 


tgaatttcat 


cgcatgaata 


1080 


ctggagataa 


gaatgtgatg 


tgtgccctgt gctatatgag 


agcttacaac 


cattgtgttt 


1140 


acagtaagtg 


tgattaactt 


tagctgggga ggcagagggt 


gactgggtgc 


tgactggttt 


1200 


atttatgtat 


atgtttttta 


tgtgtaggtc ccgtctctga 


cgtagatgag 


acccccacta 


1260 


cagagtgcat 


ttcatcaccc 


ccagaaattg gcgaggaacc 


gcccgaagat 


attattca'ta 


1320 


gaccagttgc 


agtgagagtc 


accgggcgta gagcagctgt 


ggagagtttg 


gatgacttgc 


1380 


tacagggtgg 


ggatgaacct 


ttggacttgt gtacccggaa 


acgccccagg 


cactaagtgc 


1440 


cacacatgtg 


tgtttactta 


aggtgatgtc agtatttata 


gggtgtggag 


tgcaataaaa 


1500 


tccgtgttga 


ctttaagtgc 


gtggtttatg actcaggggt 


ggggactgtg 


ggtatataag 


1560 


caggtgcaga 


cctgtgtggt 


cagttcagag caggactcat 


ggagatctgg 


acagtcttgg 


1620 


aagactttca 


ccagactaga 


cagctgctag agaactcatc 


ggagggagtc 


tcttacctgt 


1680 


ggagattctg 


cttcggtggg 


cctctagcta agctagtcta 


tagggccaag 


caggattata 


1740 


aggatcaatt 


tgaggatatt 


ttgagagagt gtcctggtat 


ttttgactct 


ctcaacttgg 


1800 


gccatcagtc 


tcactttaac 


cagagtattc tgagagccct 


tgacttttct 


actcctggca 


1860 


gaactaccgc 


cgcggtagcc 


ttttttgcct ttatccttga 


caaatggagt 


caagaaaccc 


1920 


atttcagcag 


ggattaccgt 


ctggactgct tagcagtagc 


tttgtggaga 


acatggaggt 


1980 


gccagcgcct 


gaatgcaatc 


tccggctact tgccagtaca 


gccggtagac 


acgctgagga 


2040 


tcctgagtct 


ccagtcaccc 


caggaacacc aacgccgcca 


gcagccgcag 


caggagcagc 


2100 
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aacaaaaoaa 


aaaccaaaaa 


gagaacctga 


gagceggtet 

3 3 v 3 3 


ggaccctccg 


ataacaaaacr 

3 "3 3 ^ 3 3 3 3 


2160 


acraacrcracrta 


actaacttgt 


ttcccgagct 


gcgccggqtg 

3 3 3 3 3 3 


ctgactaggt 


cttccagtgg 


2220 


acaacraaaaa 

w 3 3 3'"* 3 ** 3 3 


aaaattaaac 

333>** w ^ fc * s *3 w 


aagaaaaaca 

3 3 3 ** 3 *■ 3 3 


tqaqqaqact 

**3**33*"3"* w ^ 


agecacagaa 


ctgaactgac 


2280 


tgtcagtctg 


atgagtcgca 


ggcgcccaga 

33 3 3 


atcggtgtgg 

33 3 33 


tqgcatgaqg 

3 3 w 3 3 3 


tgeagtcgea 


2340 


gggqatagat 

3 3 3 3 7 


gaggtctcag 

3 3 3 — 3 


tgatgcatga 


gaaatattcc 


ctagaacaag 


tcaagacttg 


2400 


ttggttggag 

355 3 3 —3 


cccgaggatg 

3 3 3 3 


attgggaggt 

3 3 3 3 3 


agecatcagg 


aattatgeca 


agctggctct 


2460 


gaggccagac 

3 3 3 3 


aagaagtaca 


agattaccaa 


actgattaat 


atcagaaatt 


cctgctacat 


2520 


ttcagggaat 


acfacrcccracfa 

3 3 3 3 ^^"3 M 5? 


taaaaatcag 

** 3 3 ** 3 a *» w< * 3 


tacccaaaaa 


aaaatacrcct 


t cagat octq 


2580 


catgatgaat 


atgtacccgg 


333 *"3 3 **3 3 3 


cataaaaaaa 


gt caccttta 


tgaacgcgaa 

\,yaauy^.yay 


2640 


crttcaggagt 


qatqqqtata 

3 3 3 3 


atacragtaot 


etttatggee 


aacaccaagc 


tgacagtgca 


2700 


cggatgctcc 


ttctttggct 


tcaataacat 


qtgcattqaq 


acctagqaca 


gtgt 1 1 cagt 


2760 


gaggggatgc 

3 3 3 33^* ~3 


agtttttcag 


ccaactggat 


aggggtegtg 

33333^^3 w 3 


ggcagaacca 


agagcatggt 


2820 


gtcagtgaag 

3 3 3 ^3 


aaatgcctgt 


tcgagaqgtg 

^^3^3^33 3 


ccacctgggg 


gtgatgagcg 

3 w 3** w 3 3 w 3 


agggegaage 

u 33? 3 3 


2880 


caaagtcaaa 


cactgcgcct 


ctaccgagac 


gcractacttt 


gtactgatca 


aaaacaatac 


2940 


caaagtcaag 


cataatatga 


t ctgtggggc 


cteggatgag 


ccrcaactacc 


aaatcrctaac 


3000 




aaaaacaacc 

\if HpJ Np* W Vp* >J v^ w 


atatactacrc 


cacccrtacat 


ataacctcac 


acccccocaa 


3060 


aacataaccc 


aaattcaaac 


acaacatcat 

^4 \* V»* ^* W> ^5 WWW* W 


aacccactac 


aatatocacc 

W& U w WJ W WJ w%4 WW 


taaaatccca 


3120 


ccqaggcatg 

3 3 3 3 


ttcatgccct 


accagtgcaa 


catgeaattt 


gtgaaggtgc 


tqctaqaqcc 


3180 


cgatgccatg 


tccagagtga 


gcqtgacggg 


ggtgtttgac 


atgaatgtgg 


agctgtggaa 


3240 


aattctgaga 


tatgatgaat 


ccaagaccag 


gtgccgggcc 


tgcgaatgcg 


gaggcaagca 


3300 


cgccaggctt 


cagcccgtgt 


gtgtggaggt 

3 3 3 3 3 3 


gaeggaggae 

3 33 3 3 


ctgcgacccg 


atcatttggt 


3360 


gttgtcctgc 


aacacrcracacr 


agttcggctc 


caacaaaaaa 

3 ^3 3 33**** 


gaatctgact 


aaaataaata 


3420 


crtcrtttocrcfa 

N|j » » » 3 3 3 


ctcraat acf era 


erect a cat era 

y w 3 **** 3 


tofcrcrcacraat 


aactaaaatc 


tatatttttc 

w U W< WJ w L« WJ W> W> W* 


3480 


tacacaacaa 


cataaacaaa 


aacacctcct 

« %J W^ WJ W* WJ W W w- \m 




aatattcaac 

WJ SJ w w* w* w> w- u, w w> 


ccttatctoa 

w> wj w w a Wj w^ w> wj u 


3540 


caaaaccrtct 

w 3 3 3 3 ^3 w w 


cccctcctaa 


acaacracrt ac 


gtcagaatgt 


aataacratcc 


acaataaaca 


3600 


gccggcccgt 


gcagcccgcg 


aactcttcaa 


ccctgaccta 


cgcgaccctg 


agctcctcgt 


3660 


ccgtggacgc 


agctgccgcc 


geagctgetg 


cttccgccgc 


cagcgccgtg 


cgcggaatgg 


3720 


ccctgggcgc 


cggctactac 


agctctctgg 


tggccaactc 


gagttccacc 


aataatcccg 


3780 


ccagcctgaa 


cgaggagaag 


ctgctgctgc 


tgatggccca 


gctcgaggcc 


ctgacccagc 


3840 
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gcctgggcga gctgacccag caggtggctc agctgcaggc ggagacgcgg gccgcggttg 3900 

ccacggtgaa aaccaaataa aaaatgaatc aataaataaa cggagacggt tgttgatttt 3960 

aacacagagt cttgaatctt tatttgattt ttcgcgcgcg gtaggccctg gaccaccggt 4020 

ctcgatcatt gagcacccgg tggatctttt ccaggacccg gtagaggtgg gcttggatgt 4080 

tgaggtacat gggcatgagc ccgtcccggg ggtggaggta gctccattgc agggcctcgt 4140 

gctcgggggt ggtgttgtaa atcacccagt catagcaggg gcgcagggcg tggtgctgca 4200 

cgatgtcctt gaggaggaga ctgatggcca cgggcagccc cttggtgtag gtgttgacga 4260 

acctgttgag ctgggaggga tgcatgcggg gggagatgag atgcatcttg gcctggatct 4320 

tgagattggc gatgttcccg cccagatccc gccgggggtt catgttgtgc aggaccacca 4380 

gcacggtgta tccggtgcac ttggggaatt tgtcatgcaa cttggaaggg aaggcgtgaa 4440 

agaatttgga gacgcccttg tgaccgccca ggttttccat gcactcatcc atgatgatgg 4500 

cgatgggccc gtgggcggcg gcttgggcaa agacgtttcg ggggtcggac acatcgtagt 4560 

tgtggtcctg ggtgagctcg tcataggcca ttttaatgaa tttggggcgg agggtgcccg 4620 

actgggggac gaaggtgccc tcgatcccgg gggcgtagtt gccctcgcag atctgcatct 4680 

cccaggcctt gagctcggag ggggggatca tgtccacctg cggggcgatg aaaaaaacgg 4740 

tttccggggc gggggagatg agctgggccg aaagcaggtt ccggagcagc tgggacttgc 4800 

cgcagccggt ggggccgtag atgaccccga tgaccggctg caggtggtag ttgagggaga 4860 

gacagctgcc gtcctcgcgg aggagggggg ccacctcgtt catcatctcg cgcacatgca 4920 

tgttctcgcg cacgagttcc gccaggaggc gctcgccccc aagcgagagg agctcttgca 4980 

gcgaggcgaa gtttttcagc ggcttgagcc cgtcggccat gggcattttg gagagggtct 5040 

gttgcaagag ttccagacgg tcccagagct cggtgatgtg ctctagggca tctcgatcca 5100 

gcagacctcc tcgtttcgcg ggttggggcg actgcgggag tagggcacca ggcgatgggc 5160 

gtccagcgag gccagggtcc ggtccttcca ggggcgcagg gtccgcgtca gcgtggtctc 5220 

cgtcacggtg aaggggtgcg cgccgggctg ggcgcttgcg agggtgcgct tcaggctcat 5280 

ccggctggtc gagaaccgct cccggtcggc gccctgcgcg tcggccaggt agcaattgag 5340 

catgagttcg tagttgagcg cctcggccgc gtggcccttg gcgcggagct tacctttgga 5400 

agtgtgtccg cagacgggac agaggaggga cttgagggcg tagagcttgg gggcgaggaa 5460 

gacggactcg ggggcgtagg cgtccgcgcc gcagctggcg cagacggtct cgcactccac 5520 
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gagccaggtg 

gcgtttctta 

cgtgtccccg 

gtagaggaac 

cacgtgggag 

gcacatgtcc 

accgggggtc 

ttccggatcg 

catgacctcg 

gccgttggag 

gttgtcgagc 

gcgcatggtc 

gtactcgcgc 

tctgacccgc 

gcgcaggggc 

cgggtccagc 

ctcggggtcg 

cacggccagc 

cgcggaggcg 

gtaggtgggg 

cgagggcgcg 

gatctggcgg 

gtgggcgtgg 

ggcgacgagc 

gatgtcgtac 

gcggtccttc 

catgtagaac 

agcttgcgcg 

cttgaggaac 



aggtctggcc 

cctctggtct 

tagaccgact 

cccgcccact 

gggtagcggt 

ccctcgtcca 

ccggccgggg 

ctgtccagga 

gcactcaggt 

acgcctttca 

ttggtggcga 

tggttctttt 

gccacgcact 

cagccgcggt 

tcgttggtcc 

atgagctcgt 

aagtagctga 

gcgcgctcgt 

tacatgccgc 

tagcagcgcc 

aggagcccgg 

aagatggcgt 

ggcagtccga 

tcggcggtga 

ttgagctggc 

cagtactctt 

tggttgacgg 

gccttgcgca 

tggtgcttga 



ggtcggggtc 

ccatgagctc 

ttatgggccg 

ccgagacgaa 

cgttgtccac 

catccaggaa 

gggtataaaa 

gcgccagctg 

tgtcagtttc 

tgagcccctc 

aggagccgta 

ccttgtcggc 

tccattcggg 

tgtgcagggt 

agcagaggcg 

cgggggggtc 

tgcaggtgcc 

aggggctgag 

agatgtcgta 

ccccgcggat 

tgccgaggtt 

gggagttgga 

ccgagtccct 

cgaggacgtc 

ccttctgctt 

cgagggggaa 

ccttgtaggc 

gggaggtgtg 

agtcgaggtc 



aaaaacgagg 

gtgtccccgc 

gtcctcgagc 

ggcccgggtc 

cagcgggtcc 

ggtgattggc 

gggggcgggc 

ttggggtagg 

tagaaacgag 

gtccatctgg 

gagggcgttg 

gcgctccttg 

gaagacggtg 

gatgaggtcc 

cccgcccttg 

ggcgtccacg 

cagatcgtcc 

gggcgtgccc 

gacgtagagg 

gctggcgcgc 

ggagcgctgc 

ggagatggtg 

gatgaagtgg 

cagggcgcag 

ccacagctcg 

cccgtcctga 

gcagcagccc 

ggtgagggcg 

gtcgcagccg 



tttcctccgt 

tgggtgacaa 

ggggtgccgc 

caggccagca 

accttctcca 

ttgtaagtgt 

ccctgctcgt 

tattccctct 

gaggatttga 

tcagaaaaga 

gagagcagct 

gcggcgatgt 

gtgagcttgt 

acgctggtgg 

cgcgagcaga 

gtgaagatgc 

agcgccgctt 

cagggcatgg 

ggctcctcga 

acgtagtcgt 

ggcttttcgg 

ggcctctgga 

gcgtaggagt 

tagtcgaggg 

cggttgagaa 

tcggcacggt 

ttctccacgg 

aaggtgtcgc 

ccctgctccc 



gctttttgat 

agaggctgtc 

ggtcctcgtc 

cgaaggaggc 

gggtatgcaa 

aggccacgtg 

cctcactgtc 

cgaaggcggg 

tattgacggt 

cgatcttttt 

tggcgatgga 

tgagctgcac 

cgggcacgat 

ccacctcgcc 

aggggggcag 

cgggcaggag 

gccagtcgcg 

ggtgcgtgag 

ggacgccgat 

acagctcgtg 

cgcggtagac 

agatgttgaa 

cctgcagctt 

tctcttggat 

ggaactcttc 

aagagcccac 

ggagggcgta 

gcaccatgac 

agagctggaa 



5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
•6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
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gtccgtgcgc 


ttcttgtagg 


cggggttggg 


caaagcgaaa 


gtaacatcgt 


tgaagaggat 


7320 


cttgcccgcg 


cggggcatga 


agttgcgagt 


gatgcggaaa 


ggctggggca 


cctcggcccg 


7380 


gttgttgatg 


acctgggcgg 


cgaggacgat 


ctcgtcgaag 


ccgttgatgt 


tgtgcccgac 


7440 


gatgtagagt 


tccacgaatc 


gcgggcggcc 


cttgacgtgg 


ggcagcttct 


tgagctcgtc 


7500 


gtaggtgagc 


tcggcggggt 


cgctgaggcc 


gtgctgctcg 


agggcccagt 


cggcgaggtg 


7560 


ggggttggcg 


ccgaggaagg 


aagtccagag 


atccacggcc 


agggcggtct 


gcaagcggtc 


7620 


ccggtactga 


cggaactgct 


ggcccacggc 


cattttttcg 


ggggtgacgc 


agtagaaggt 


7680 


gcgggggtcg 


ccgtgccagc 


ggtcccactt 


gagctggagg 


gcgaggtcgt 


gggcgagctc 


7740 


gacgagcggc 


gggtccccgg 


agagtttcat 


gaccagcatg 


aaggggacga 


gctgcttgcc 


7800 


gaaggacccc 


atccaggtgt 


aggtttccac 


gtcgtaggtg 


aggaagagcc 


tttcggtgcg 


7860 


aggatgcgag 


ccgatgggga 


agaactggat 


ctcctgccac 


cagttggagg 


aatggctgtt x 


7920 


gatgtgatgg 


aagtagaaat 


gccgacggcg 


cgccgagcac 


tcgtgcttgt 


gtttatacaa 


7980 


gcgtccgcag 


tgctcgcaac 


gctgcacggg 


atgcacgtgc 


tgcacgagct 


gtacctgggt 


8040 


tcctttgacg 


aggaatttca 


gtgggcagtg 


gagcgctggc 


ggctgcatct 


ggtgctgtac 


8100 


tacgtcctgg 


ccatcggcgt 


ggccatcgtc 


tgcctcgatg 


gtggtcatgc 


tgacgaggcc 


8160 


gcgcgggagg 


caggtccaga 


cctcggctcg 


gacgggtcgg 


agagcgagga 


cgagggcgcg 


8220 


caggccggag 


ctgtccaggg 


tcctgagacg 


ctgcggagtc 


aggtcagtgg 


gcagcggcgg 


8280 


cgcgcggttg 


acttgcagga 


gcttttccag 


ggcgcgcggg 


aggtccagat 


ggtacttgat 


8340 


ctccacggcg 


ccgttggtgg 


cgacgtccac 


ggcttgcagg 


gtcccgtgcc 


cctggggcgc 


8400 


caccaccgtg 


ccccgtttct 


tcttgggtgc 


tggcggcggc 


ggctccatgc 


ttagaagcgg 


8460 


cggcgaggac 


gcgcgccggg 


cggcaggggc 


ggctcggggc 


ccggaggcag 


gggcggcagg 


8520 


ggcacgtcgg 


cgccgcgcgc 


gggcaggttc 


tggtactgcg 


cccggagaag 


actggcgtga 


6580 


gcgacgacgc 


gacggttgac 


gtcctggatc 


tgacgcctct 


gggtgaaggc 


cacgggaccc 


8640 


gtgagtttga 


acctgaaaga 


gagttcgaca 


gaatcaatct 


cggtatcgtt 


gacggcggcc 


8700 


tgccgcagga 


tctcttgcac 


gtcgcccgag 


ttgtcctggt 


aggcgatctc 


ggtcatgaac 


8760 


tgctcgatct 


cctcctcctg 


aaggtctccg 


cgaccggcgc 


gctcgacggt 


ggccgcgagg 


8820 


tcgttggaga 


tgcggcccat 


gagctgcgag 


aaggcgttca 


tgccggcctc 


gttccagacg 


8880 


cggctgtaga 


ccacggctcc 


gtcggggtcg 


cgcgcgcgca 


tgaccacctg 


ggcgaggttg 


8940 
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agctcgacgt 

gtggtggcga 

ctgacgtcgc 

aaaaactggg 

gcgatggtgg 

tcctcctctt 

gggggagggg 

gtctccccgc 

agcgtgaaga 

gagagggcgc 

agcgtctcga 

tcgcaaggta 

gcgatgctgc 

agcaccaggt 

tggtcctgac 

tcctcctcgc 

agcgccaggt 

gtctggaagt 

cagttggcca 

ttgaggcgcg 

tggtatccga 

ggggcgccgg 

atccaggtga 

atgttgcgca 

cagtcgtgga 

ggcctggagg 

gaatcaggct 

taacgaaacc 

gaaactagta 



ggcgcgtgaa 

tgtgctcggt 

ccagggcttc 

agttgcgcgc 

cgcgcacctc 

ccatctcctc 

ccctgcgtcg 

gccggcgacg 

cgccgccgcg 

tgacgatgca 

gatccacggg 

ggctgagccc 

tggtgatgaa 

ccttgggccc 

acctggcgag 

ccgcgcggcc 

cggcgacgac 

cgtcgaagtc 

tgacggacca 

agtaggcgcg 

cgaggaagtg 

gcgcgaggtc 

tgccggcggc 

gcggcaggaa 

tgctctagac 

ctaagcgaac 

ggagccgcag 

tccaggatac 

agcgcggaaa 



gaccgcgtag 

gacgaagaag 

caagcgctcc 

cgagacggtc 

gcgctcgaag 

cactaacatc 

ccggcggcgc 

catggtctcg 

catctccagg 

tcttatcaat 

atccgaaaac 

ggtttcttgt 

gttgaagtag 

ggcttgctgg 

gtccttgtag 

gtgcatgcgc 

gcgctcggcg 

gacgaagcgg 

gttgacggtc 

cgtgtcgaag 

cggcggcggc 

ctcgagcatg 

ggtggtggag 

gtagttcatg 

atacgggcaa 

gggttgggct 

ctaacgtggt 

ggaggcgggt 

gcggccgtcc 



ttgcagaggc 
tacatgatcc 
atggcctcgt 
aactcctcct 
gccccggggg 
tcttctactt 
acgggcagac 
gtgacggcgc 
tggccgccgg 
tggcccgtag 
cgctgaacga 
t'cttcgggta 
gcggtcctga 
atgcgcagac 
tagtcctgca 
gtgagcccga 
aggatggcct 
tggtaggctc 
tggtggccgg 
atgtagtcgt 
tggcggtaga 
aggcggtggt 
gcgcgcggga 
gtggccgcgg 
aaacgaaagc 
gcgcgtgtac 
actggcactc 
cgttttggcc 
gcgatggctc 



gctggtagag 

agcggcggag 

agaagtccac 

ccagaagacg 

gctcctcttc 

cctcctcagg 

ggtcgatgaa 

gcccgtcctc 

gggggtctcc 

ggactccgcg 

aggcttcgag 

tttggtcggg 

gacggcggat 

ggtcggccat 

tgagccgctc 

acccgcgctg 

gctggatctg 

cggtgttgat 

ggcgcacgag 

tgcaggtgcg 

gcggccatcg 

agccgtagat 

actcgcggac 

tctggcccgt 

ggtcagcggc 

cccggttcga 

ccgtctcgac 

attttcgtca 

gctgccgtag 



gtagttgagc 

cggcatctcg 

ggcgaagttg 

gatgagctcg 

ttccatctcc 

aggcggcggc 

gcgctcgatg 

gcggggccgc 

gttgggcagg 

caaggacctg 

ccagtcgcag 

aggcgggcgg 

ggtggcgagg 

gccccaggcg 

cacgggcacc 

cggctggacg 

ggtgagggtg 

ggtgtaggag 

ctcgtggtac 

cacgaggtac 

ctcggtggcg 

gtacctggac 

gqggttccag 

gaggcgcgcg 

tcgactccgt 

gtccctgctc 

ccaagcctgc 

ggccggaaat 

tctggagaaa 



9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
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gaatcgccag ggttgcgttg cggtgtgccc cggttcgagc ctcagcgctc ggcgccggcc 10740 

ggattccgcg gctaacgtgg gcgtggctgc cccgtcgttt ccaagacccc ttagccagcc 10800 

gacttctcca gttacggagc gagcccctct ttttcttgtg tttttgccag atgcatcccg 10860 

tactgcggca gatgcgcccc caccctccac cacaaccgcc cctaccgcag cagcagcaac 10920 

agccggcgct tctgcccccg ccccagcagc agcagccagc cactaccgcg gcggccgccg 10980 

tgagcggagc cggcgttcag tatgacctgg ccttggaaga gggcgagggg ctggcgcggc 11040 

tgggggcgtc .gtcgccggag cggcacccgc gcgtgcagat gaaaagggac gctcgcgagg 11100 

cctacgtgcc caagcagaac ctgttcagag acaggagcgg cgaggagccc gaggagatgc 11160 

gcgcctcccg cttccacgcg gggcgggagc tgcggcgcgg cctggaccga aagcgggtgc 11220 

tgagggacga ggatttcgag gcggacgagc tgacggggat cagccccgcg cgcgcgcacg 11280 

tggccgcggc caacctggtc acggcgtacg agcagaccgt gaaggaggag agcaacttcc 11340 

aaaaatcctt caacaaccac gtgcgcacgc tgatcgcgcg cgaggaggtg accctgggcc 11400 

tgatgcacct gtgggacctg ctggaggcca tcgtgcagaa ccccacgagc aagccgctga 11460 

cggcgcagct gtttctggtg gtgcagcaca gtcgggacaa cgagacgttc agggaggcgc 11520 

tgctgaatat caccgagccc gagggccgct ggctcctgga cctggtgaac attctgcaga 11580 

gcatcgtggt gcaggagcgc gggctgccgc tgtccgagaa gctggcggcc atcaacttct 11640 

cggtgctgag cctgggcaag tactacgcta ggaagatcta caagaccccg tacgtgccca 11700 

tagacaagga ggtgaagatc gacgggtttt acatgcgcat gaccctgaaa gtgctgaccc 11760 

tgagcgacga tctgggggtg taccgcaacg acaggatgca ccgcgcggtg agcgccagcc 11820 

gccggcgcga gctgagcgac caggagctga tgcacagcct gcagcgggcc ctgaccgggg 11880 

ccgggaccga gggggagagc tactttgaca tgggcgcgga cctgcgctgg cagcctagcc 11940 

gccgggcctt ggaagctgcc ggcggttccc cctacgtgga ggaggtggac gatgaggagg 12000 

aggagggcga gtacctggaa gactgatggc gcgaccgtat ttttgctaga tgcagcaaca 12060 

gccaccgccg cctcctgatc ccgcgatgcg ggcggcgctg cagagccagc cgtccggcat 12120 

taactcctcg gacgattgga cccaggccat gcaacgcatc atggcgctga cgacccgcaa 12180 

tcccgaagcc tttagacagc agcctcaggc caaccgactc tcggccatcc tggaggccgt 12240 

ggtgccctcg cgctcgaacc ccacgcacga gaaggtgctg gccatcgtga acgcgctggt 12300 

ggagaacaag gccatccgcg gcgacgaggc cgggctggtg tacaacgcgc tgctggagcg 12360 
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cgtggcccgc tacaacagca 


ccaacgtgca gacgaacctg 


gaccgcatgg 


tgaccgacgt 


12420 


gcgcgaggcg gtgtcgcagc 

^vf* 777 77 7 W W 


gcgagcggtt ccaccgcgag 


tcgaacctgg 


gctccatggt 


12480 


ggcgctgaac gccttcctga 


gcacgcagcc cgccaacgtg 


ccccggggcc 


aggaggacta 


12540 


caccaacttc atcagcgcgc 


tgcggctgat ggtggccgag 

www w w w w w ww 


gtgccccaga 


gcgaggtgta 


12600 


ccagtcgggg ccggactact 

W 7 7 W Zf 7 7 


tcttccagac cagtcgccag 


ggcttgcaga 


ccgtgaacct 


12660 


gagccaggct ttcaagaact 


tgcagggact gtggggcgtg 


caggccccgg 


tcggggaccg 


12720 


cgcgacggtg tcgagcctgc 


tgacgccgaa ctcgcgcctg 


ctgctgctgc 


tggtggcgcc 


12780 


cttcacggac agcggcagcg 


tgagccgcga ctcgtacctg 


ggctacctgc 


ttaacctgta 


12840 


ccgcgaggcc atcgggcagg 

w w 77 w w w ww 


cgcacgtgga cgagcagacc 

W WWW WW w 


taccaggaga 


tcacccacgt 


12900 


gagccgcgcg ctgggccagg 


aggacccggg caacctggag 

WW WWW WW w 


gccaccctga 


acttcctgct 


12960 


gaccaaccgg tcgcagaaga 


tcccgcccca gtacgcgctg 


agcaccgagg 


aggagcgcat 


13020 


cctgcgctac gtgcagcaga 


gcgtggggct gttcctgatg 


caggaggggg 


ccacgcccag 


13080 


cgccgcgctc gacatgaccg 

www w w w 


cgcgcaacat ggagcccagc 


atgtacgccc 


gcaaccgccc 


13140 


gttcatcaat aagctgatgg 


actacttgca tcgggcggcc 


gccatgaact 


cggactactt 


13200 


taccaacgcc atcttgaacc 


cgcactggct cccgccgccc 


gggttctaca 


cgggcgagta 


13260 


cgacatgccc gaccccaacg 


acgggttcct gtgggacgac 


gtggacagca 


gcgtgttctc 


13320 


gccgcgcccc accaccacca 


ccgtgtggaa gaaagagggc 


ggggaccggc 


ggccgtcctc 


13380 


ggcgctgtcc ggtcgcgcgg 


gtgctgccgc ggcggtgccc 


gaggccgcca 


gccccttccc 


13440 


gagcctgccc ttttcgctga 


acagcgtgcg cagcagcgag 


ctgggtcggc 


tgacgcggcc 


13500 


gcgcctgctg ggcgaggagg 

ww w w ^ w w w w w w 


agtacctgaa cgactccttg 


cttcggcccg 


agcgcgagaa 


13560 


gaacttcccc aataacggga 


tagagagcct ggtggacaag 


atgagccgct 


ggaagacgta 


13620 


cgcgcacgag cacagggacg 

W W WW www w 


agccccgagc tagcagcagc 


accggcgcca 


cccgtagacg 


13680 


ccagcggcac gacaggcagc 


ggggtctggt gtgggacgat 


gaggattccg 


ccgacgacag 


13740 


cagcgtgttg gacttgggtg 


ggagtggtgg tggtaacccg 


ttcgctcacc 


tgcgcccccg 


13800 


tatcgggcgc ctgatgtaag 


aatctgaaaa aataaaagac 


ggtactcacc 


aaggccatgg 


13860 


cgaccagcgt gcgttcttct 


ctgttgtttg tagtagt atg atg agg cgc gtg tac 

Met Met Arg Arg Val Tyr 


13915 



1 5 

ccg gag ggt cct cct ccc teg tac gag age gtg atg cag cag gcg gtg 13963 
Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Gin Gin Ala Val 
10 15 20 
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gcg gcg gcg atg cag ccc ccg ctg gag gcg cct tac gtg ccc ccg egg 14011 
Ala Ala Ala Met Gin Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg 
25 30 35 

tac ctg gcg cct acg gag ggg egg aac age att cgt tac teg gag ctg 14059 
Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser lie Arg Tyr Ser Glu Leu 
40 45 50 

gca ccc ttg tac gat acc acc egg ttg tac ctg gtg gac aac aag teg 14107 
Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser 
55 60 65 70 

gcg gac ate gec teg ctg aac tac cag aac gac cac age aac ttc ctg 14155 
Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu 
75 80 85 

acc acc gtg gtg cag aac aac gat ttc acc ccc acg gag gec age acc 14203 
Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr 
90 95 100 

cag acc ate aac ttt gac gag cgc teg egg tgg ggc ggc cag ctg aaa 14251 
Gin Thr He Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys 
105 110 115 

acc ate atg cac acc aac atg ccc aac gtg aac gag ttc atg tac age 14299 
Thr He Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser 
120 125 130 

aac aag ttc aag gcg egg gtg atg gtc teg cgc aag acc ccc aac ggg 14347 
Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly 
135 140 145 150 

gtc aca gta aca gat ggt agt cag gac gag ctg acc tac gag tgg gtg 14395 
Val Thr Val Thr Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val 
155 160 165 

gag ttt gag ctg ccc gag ggc aac ttc teg gtg acc atg acc ate gat 14443 
Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr He Asp 
170 175 180 

ctg atg aac aac gec ate ate gac aac tac ttg gcg gtg ggg egg cag 14491 
Leu Met Asn Asn Ala He He Asp Asn Tyr Leu Ala Val Gly Arg Gin 
185 190 195 

aac ggg gtg ctg gag age gac ate ggc gtg aag ttc gac acg cgc aac 14539 
Asn Gly Val Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn 
200 205 210 

ttc egg ctg ggc tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg 14587 
Phe Arg Leu Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val 
215 220 225 230 

tac acc aac gag gee ttc cac ccc gac ate gtc ctg ctg ccc ggc tgc 14635 
Tyr Thr Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys 
235 240 245 
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ggc gtg gac ttc acc gag age cgc etc age aac ctg ctg ggc ate cgc 14683 
Gly Val Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly lie Arg 
250 255 260 

aag egg cag ccc ttc cag gag ggc ttc cag ate ctg tac gag gac ctg 14731 
Lys Arg Gin Pro Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp Leu 
265 270 275 

gag ggg ggc aac ate ccc gcg ctg ctg gac gtg gac gec tac gag aaa 14779 
Glu Gly Gly Asn He Pro Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys 
280 285 290 

age aag gag gat age gec gee gcg gcg acc gca gec gtg gee acc gec 14827 
Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr Ala Ala Val Ala Thr Ala 
295 300 305 310 

tct acc gag gtg egg ggc gat aat ttt get age gec gcg aca ctg gca 14875 
Ser Thr Glu Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Thr Leu Ala 
315 320 325 

gcg gec gag gcg get gaa acc gaa agt aag ata gtg ate cag ccg gtg 14923 
Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys He Val He Gin Pro Val 
330 335 340 

gag aag gac age aag gag agg age tac aac gtg etc gcg gac aag aaa 14971 
Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys 
345 350 355 

aac acc gec tac cgc age tgg tac ctg gec tac aac tac ggc gac ccc 15019 
Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro 
360 365 370 

gag aag ggc gtg cgc tec tgg acg ctg etc acc acc teg gac gtc acc 15067 
Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr 
375 380 385 390 

tgc ggc gtg gag caa gtc tac tgg teg ctg ccc gac atg atg caa gac 15115 
Cys Gly Val Glu Gin Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp 
395 400 405 

ccg gtc acc ttc cgc tec acg cgt caa gtt age aac tac ccg gtg gtg 15163 
Pro Val Thr Phe Arg Ser Thr Arg Gin Val Ser Asn Tyr Pro Val Val 
410 415 420 

ggc gec gag etc ctg ccc gtc tac tec aag age ttc ttc aac gag cag 15211 
Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin 
425 430 435 

gee gtc tac teg cag cag ctg cgc gee ttc acc teg etc acg cac gtc 15259 
Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe Thr Ser Leu Thr His Val 
440 " 445 450 

ttc aac cgc ttc ccc gag aac cag ate etc gtt cgc ccg ccc gcg ccc 15307 
Phe Asn Arg Phe Pro Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro 
455 460 465 470 
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acc att acc acc gtc agt gaa aac gtt cct get etc aca gat cac ggg 
Thr He Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly 
475 480 485 



15355 



acc ctg ccg ctg cgc age agt ate egg gga gtc cag cgc gtg acc gtc 15403 
Thr Leu Pro Leu Arg Ser Ser He Arg Gly Val Gin Arg Val Thr Val 
490 495 500 

act gac gec aga cgc cgc acc tgc ccc tac gtc tac aag gec ctg ggc 15451 
Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly 
505 ~ 510 515 



gta gtc gcg ccg cgc gtc etc teg age cgc acc ttc taa aaaatgtcca 
Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe 
520 525 530 



15500 



ttctcatctc geccagtaat aacaceggtt ggggcctgcg cgcgcccagc aagatgtacg 15560 

gaggegcteg ccaacgctcc acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc 15620 

cctggggcgc cctcaagggc cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc 15680 

aggtggtggc cgacgcgcgc aactacacgc ccgccgccgc gcccgtctcc accgtggacg 15740 

ccgtcatcga cagcgtggtg gccgacgcgc gccggtacgc ccgcgccaag agccggcggc 15800 

ggcgcatcgc ccggcggcac cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc 15860 

geagggecag gcgcacggga cgcagggcca tgetcaggge ggccagacgc gcggcctccg 15920 

gcagcagcag cgccggcagg acccgcagac gcgcggccac ggcggcggcg gcggccatcg 15980 

ccagcatgtc ccgcccgcgg cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg 16040 

tgcgcgtgcc cgtgcgcacc cgcccccctc gcacttgaag atgetgaett cgcgatgttg 16100 

atgtgtccca gcggcgagga ggatgtccaa gegcaaatte aaggaagaga tgctccaggt 16160 

catcgcgccf gagatctacg gcccggcggc ggtgaaggag gaaagaaagc cccgcaaact 16220 

gaagcgggtc aaaaaggaca aaaaggagga ggaagatgtg gaeggactgg tggagtttgt 16280 

gcgcgagttc gccccccggc ggcgcgtgca gtggcgcggg cggaaagtga aaceggtget 16340 

gcgacccggc accacggtgg tcttcacgcc eggegagegt tccggctccg cctccaagcg 16400 

ctcctacgac gaggtgtacg gggacgagga catcctcgag caggcggccg aacgtctggg 16460 

egagtttget taeggcaage gcagccgccc cgcgcccttg aaagaggagg cggtgtccat 16520 

cccgctggac cacggcaacc ccacgccgag cctgaagccg gtgaccctgc ageaggtget 16580 

gcctggtgcg gcgccgcgcc ggggcttcaa gegegaggge ggegaggate tgtacccgac 16640 

catgeagetg atggtgccca agegecagaa gctggaggac gtgctggagc acatgaaggt 16700 

ggaccccgag gtgcagcccg aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct 16760 
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gggcgtgcag 


accgtggaca 


tcaagatccc 


cacggagccc atggaaacgc 


agaccgagcc 


16820 


cgtgaagccc 


agcaccagca 


ccatggaggt 


gcagacggat ccctggatgc 


cggcaccggc 


16880 


ttccaccacc 


cgccgaagac 


gcaagtacgg 


cgcggccagc ctgctgatgc 


ccaactacgc 


16940 


gctgcatcct 


tccatcatcc 


ccacgccggg 


ctaccgcggc acgcgcttct 


accgcggcta 


17000 


caccagcagc 


cgccgccgca 


agaccaccac 


ccgccgccgc cgtcgtcgca 


cccgccgcag 


17060 


cagcaccgcg 


acttccgccg 


ccgccctggt 


gcggagagtg taccgcagcg 


ggcgcgagcc 


17120 


tctgaccctg 


ccgcgcgcgc 


gctaccaccc 


gagcatcgcc atttaactac 


cgcctcctac 


17180 


ttgcagatat 


ggccctcaca 


tgccgcctcc 


gcgtccccat tacgggctac 


cgaggaagaa 


17240 


agccgcgccg 


tagaaggctg 


acggggaacg 


ggctgcgtcg ccatcaccac 


cggcggcggc 


17300 


gcgccatcag 


caagcggttg 


gggggaggct 


tcctgcccgc gctgatgccc 


atcatcgccg 


17360 


cggcgatcgg 


ggcgatcccc 


ggcatagctt 


ccgtggcggt gcaggcctct 


cagcgccact 


17420 


gagacacagc 


ttggaaaatt 


tgtaataaaa 


aatggactga cgctcctggt 


cctgtgatgt 


17480 


gtgtttttag 


atggaagaca 


tcaatttttc 


gtccctggca ccgcgacacg 


gcacgcggcc 


17540 


gtttatgggc 


acctggagcg 


acatcggcaa 


cagccaactg aacgggggcg 


ccttcaattg 


17600 


gagcagtctc 


tggagcgggc 


ttaagaattt 


cgggtccacg ctcaaaacct 


atggcaacaa 


17660 


ggcgtggaac 


agcagcacag 


ggcaggcgct 


gagggaaaag ctgaaagagc 


agaacttcca 


17720 


gcagaaggtg 


gtcgatggcc 


tggcctcggg 


catcaacggg gtggtggacc 


tggccaacca 


17780 


ggccgtgcag 


aaacagatca 


acagccgcct 


ggacgcggtc ccgcccgcgg 


ggtccgtgga 


17840 


gatgccccag 


gtggaggagg 


agctgcctcc 


cctggacaag cgcggcgaca 


agcgaccgcg 


17900 


tcccgacgcg 


gaggagacgc 


tgctgacgca 


cacggacgag ccgcccccgt 


acgaggaggc 


17960 


ggtgaaactg 


ggtctgccca 


ccacgcggcc 


cgtggcgcct ctggccaccg 


gggtgctgaa 


18020 


acccagcagc 


agcagcagcc 


agcccgcgac 


cctggacttg cctccgcctg 


cttcccgccc 


18080 


ctccacagtg 


gctaagcccc 


tgccgccggt 


ggccgtcgcg tcgcgcgccc 


cccgaggccg 


18140 


cccccaggcg 


aactggcaga 


gcactctgaa 


cagcatcgtg ggtctgggag 


tgcagagtgt 


18200 


gaagcgccgc 


cgctgctatt 


aaaagacact 


gtagcgctta acttcjcttgt 


ctgtgtgtat 


18260 


atgtatgtcc 


gccgaccaga 


aggaggagga 


agaggcgcgt cgccgagttg 


caag atg 


18317 



Met 
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18557 



18605 



gcc acc cca teg atg ctg ccc cag tgg gcg tac atg cac ate gec gga 18365 
Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His He Ala Gly 
535 540 545 

cag gac get teg gag tac ctg agt ccg ggt ctg gtg cag ttc gcc cgc 18413 
Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala Arg 
550 ~ 555 560 

gcc aca gac acc tac ttc agt ctg ggg aac aag ttt agg aac ccc acg 18461 
Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr 
565 ** 570 575 

gtg gcg ccc acg cac gat gtg acc acc gac cgc age cag egg ctg acg 18509 
Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu Thr 
580 585 590 595 

ctg cgc ttc gtg ccc gtg gac cgc gag gac aac acc tac teg tac aaa 
Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys 
600 605 610 

gtg cgc tac acg ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc 
Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala 
615 620 625 

age acc tac ttt gac ate cgc ggc gtg ctg gat egg ggc cct age ttc 18653 
Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro Ser Phe 
630 635 640 

aaa ccc tac tec ggc acc get tac aac age ctg get ccc aag gga gcg 18701 
Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala 
645 650 655 

ccc aac act tgc cag tgg aca tat aaa get gat ggt gat act ggt aca 18749 
Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly Thr 
660 665 670 675 

gaa aaa acc tat aca tat gga aat gcg cct gtg caa ggc att agt att 18797 
Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly He Ser He 
680 685 690 

aca aaa gat ggt att caa ctt gga act gac act gat gat cag ccc att 
Thr Lys Asp Gly He Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro He 
695 700 705 

tat gca gat aaa act tat caa cca gag cct caa gtg ggt gat get gaa 
Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala Glu 
710 715 720 

tgg cat gac ate act ggt act gat gaa aaa tat gga ggc aga get etc 18941 
Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu 
725 730 735 



18845 



18893 



aag cct gac acc aaa atg aag ccc tgc tat ggt tct ttt gcc aag cct 
Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro 
740 " 745 750 755 



18989 
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acc aat aaa gaa gga ggt cag gca aat gtg aaa acc gaa aca ggc ggt 19037 
Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly Gly 
760 765 770 

acc aaa gaa tat gac att gac atg gca ttc ttc gat aat cga agt gca 19085 
Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe Asp Asn Arg Ser Ala 
775 780 785 

get gcg get ggc ctg gec cca gaa att gtt ttg tat act gag aat gtg 19133 
Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr Thr Glu Asn Val 
790 795 800 

gat ctg gaa act cca gat act cat att gta tac aag gcg ggc aca gat 19181 
Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys Ala Gly Thr Asp 
805 810 815 

gac age age tct tct ate aat ttg ggt cag cag tec atg ccc aac aga 19229 
Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin Ser Met Pro Asn Arg 
820 825 830 835 

ccc aac tac att ggc ttt aga gac aac ttt ate ggg etc atg tac tac 19277 
Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr 
840 845 ~ 850 

aac age act ggc aac atg ggc gtg ctg get ggt cag gec tec cag ctg 19325 
Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
855 860 865 

aat get gtg gtg gac ttg cag gac aga aac act gaa ctg tec tac cag 19373 
Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
870 875 880 

etc ttg ctt gac tct ctg ggc gac aga acc agg tat ttc agt atg tgg 19421 
Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 
885 890 895 

aat cag gcg gtg gac age tat gac ccc gat gtg cgc att att gaa aat 19469 
Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn 
900 905 910 915 

cac ggt gtg gag gat gaa etc cct aac tat tgc ttc ccc ctg gat get 19517 
His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala 
920 925 930 

gtg ggt aga act gat act tac cag gga att aag gec aat ggt get gat 19565 
Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Ala Asp 
935 940 945 

caa acc acc tgg acc aaa gat gat act gtt aat gat get aat gaa ttg 19613 
Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu 
950 955 960 

ggc aag ggc aat cct ttc gec atg gag ate aac ate cag gec aac ctg 19661 
Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn Leu 
965 970 975 



15 



WO 03/046124 



PCT/US02/33645 



tgg egg aac ttc etc tac gcg aac gtg gcg ctg tac ctg ccc gac tec 
Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser 
980 985 990 995 

tac aag tac acg ccg gec aac ate acg ctg ccg acc aac acc aac 
Tyr Lys Tyr Thr Pro Ala Asn He Thr Leu Pro Thr Asn Thr Asn 
1000 1005 1010 

acc tac gat tac atg cac ggc cgc gtg gtg gcg ccc teg ctg gtg 
Thr Tyr Asp Tyr Met A^n Gly Arg Val Val Ala Pro Ser Leu Val 
1015 1020 1025 

gac gee tac ate aac ate ggg gcg cgc tgg teg ctg gac ccc atg 
Asp Ala Tyr He Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met 
1030 1035 1040 

gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc ctg cgc 
Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg 
1045 1050 1055 



19709 



19754 



19799 



19844 



19889 



tac cgc tec atg etc ctg ggc aac ggg cgc tac gtg ccc ttc cac 19934 
Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His 
1060 1065 1070 

ate cag gtg ccc caa aag ttc ttc gec ate aag age etc ctg etc 19979 
He Gin Val Pro Gin Lys Phe Phe Ala He Lys Ser Leu Leu Leu 
1075 1080 1085 

ctg ccc ggg tec tac acc tac gag tgg aac ttc cgc aag gac gtc 20024 
Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val 
1090 1095 HOO 

aac atg ate ctg cag age tec etc ggc aac gac ctg cgc acg gac 20069 
Asn Met He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp 
1105 1110 1115 

ggg gec tec ate gec ttc acc age ate aac etc tac gec acc ttc 20114 
Gly Ala Ser He Ala Phe Thr Ser He Asn Leu Tyr Ala Thr Phe 
1120 1125 H30 



ttc ccc atg gcg cac aac acc gee tec acg etc gag gee atg ctg 20159 
Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu 
1135 1140 H45 

cgc aac gac acc aac gac cag tec ttc aac gac tac etc teg gcg 20204 
Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala 
1150 1155 1160 

gec aac atg etc tac ccc ate ccg gee aac gee acc aac gtg ccc 20249 
Ala Asn Met Leu Tyr Pro He Pro Ala Asn Ala Thr Asn Val Pro 
1165 1170 1175 

ate tec ate ccc teg cgc aac tgg gee gee ttc cgc gga tgg tec 20294 
He Ser He Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser 
1180 1185 1190 
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ttc acg cgc etc aag acc cgc gag acg ccc teg etc ggc tec ggg 
Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly 
1195 1200 1205 

ttc gac ccc tac ttc gtc tac teg ggc tec ate ccc tac etc gac 
Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr Leu Asp 
1210 1215 1220 



ggc acc ttc tac etc aac cac acc ttc aag aag gtc tec ate acc 
Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser lie Thr 
1225 1230 1235 



ttc gac tec tec gtc age tgg ccc ggc aac gac cgc etc ctg acg 

Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
1240 1245 1250 

ccc aac gag ttc gaa ate aag cgc acc gtc gac gga gag ggg tac 

Pro Asn Glu Phe Glu lie Lys Arg Thr Val Asp Gly Glu Gly Tyr 
1255 1260 12 65 

aac gtg gee cag tgc aac atg acc aag gac tgg ttc ctg gtc cag 

Asn Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin 
1270 1275 1280 



atg ctg gee cac tac aac ate ggc tac cag ggc ttc tac gtg ccc 
Met Leu Ala His Tyr Asn • lie Gly Tyr Gin Gly Phe Tyr Val Pro 
1285 1290 ' ~ 1295 



gag ggc tac aag gac cgc atg tac tec ttc ttc cgc aac ttc cag 
Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin 
1300 1305 1310 



ccc atg age cgc cag gtc gtg gac gag gtc aac tac aag gac tac 
Pro Met Ser Arg Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr 
1315 1320 1325 

cag gec gtc acc ctg gec tac cag cac aac aac teg ggc ttc gtc 
Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser Gly Phe Val 
1330 1335 1340 



ggc tac etc gcg ccc acc atg cgc cag gga cag ccc tac ccc gec 
Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr Pro Ala 
1345 1350 " 1355 



aac tac ccc tac ccg etc ate ggc aag age gec gtc gee age gtc 
Asn Tyr Pro Tyr Pro Leu lie Gly Lys Ser Ala Val Ala Ser Val 
1360 * 1365 1370 



acc cag aaa aag ttc etc tgc gac egg gtc atg tgg cgc ate ccc 
Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg lie Pro 
1375 1380 * ' 1385 



ttc tec age aac ttc atg tec atg ggc gcg etc acc gac etc ggc 
Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly 
1390 1395 1400 



20339 



20384 



20429 



20474 



20519 



20564 



20609 



20654 



-20699 



20744 



20789 



20834 



20879 



20924 
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cag aac atg etc tac gec aac tec gec cac gcg eta gac atg aat 20969 
Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn 
1405 1410 1415 

ttc gaa gtc gac ccc atg gat gag tec acc ctt etc tat gtt gtc 21014 
Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val 
1420 1425 1430 

ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc ggc 21059 
Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His Arg Gly 
1435 1440 1445 

gtc ate gag gec gtc tac ctg cgc acg ccc ttc teg gee ggc aac 21104 
Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn 
1450 " 1455 1460 

gec acc acc taa gccccgctct tgcttcttgc aagatgaegg cctgtgcggg 21156 
Ala Thr Thr 



ctccggcgag 


caggagctca 


gggccatcct 


ccgcgacctg 


ggctgeggge 


cctgcttcct 


21216 


gggcaccttc 


gaeaageget 


tecegggatt 


catggccccg 


cacaagctgg 


cctgcgccat 


21276 


cgtcaacacg 


gccggccgcg 


agaceggggg 


cgagcactgg 


ctggccttcg 


cctggaaccc 


21336 


gcgctcccac 


acctgctacc 


tcttcgaccc 


ettegggtte 


teggacgage 


gectcaagea 


21396 


gatctaccag 


ttcgagtacg 


agggectget 


gcgccgcagc 


gccctggcca 


ccgaggaccg 


21456 


f-* t~t /-* /t t~ r* r* 


ft ft A 3 3 3 it ^ 


/■• 3 z± ft 3 f* 

C C C CCl (J cl C 


/-i<v+- rr <**■ 3 ft ft ft ^ 


ccycy tLugg 


r*i-»+" ft f* ft ft 


4. X -J 


gctcttctgc 


tgcatgttcc 


tgcacgcctt 


cgtgcactgg 


cccgaccgcc 


ccatggacaa 


21S76 


gaaccccacc 


atgaacttgc 


tgacgggggt 


gcccaacggc 


atgctccagt 


cgccccaggt 


21636 


ggaacccacc 


ctgcgccgca 


accaggaggc 


gctctaccgc 


ttcctcaacg 


cccactccgc 


21696 


ctactttcgc 


tcccaccgcg 


cgcgcatcga 


gaaggccacc 


gccttcgacc 


gcatgaatca 


21756 


agacatgtaa 


accgtgtgtg 


tatgtgaatg 


ctttattcat 


aataaacagc 


acatgtttat 


21816 


gccacctttt 


ctgaggctct 


gactttattt 


agaaatcgaa 


ggggttctgc 


cggctctcgg 


21876 


cgtgccccgc 


gggcagggat 


acgttgcgga 


actggtactt 


gggcagccac 


ttgaactegg 


21936 


ggatcagcag 


cttcggcacg 


gggaggtegg 


ggaacgagtc 


gctccacagc 


ttgcgcgtga 


21996 


gttgeaggge 


gcccagcagg 


tegggegegg 


agatcttgaa 


atcgcagttg 


ggacccgcgt 


22056 


tctgcgcgcg 


ggagttgcgg 


tacaeggggt 


tgcagcactg 


gaacaccatc 


agggcegggt 


22116 


gcttcacgct 


cgceageacc 


gtcgegtegg 


tgatgccctc 


cacgtccaga 


tcctcggcgt 


22176 


tggccatccc 


gaagggggtc 


atettgeagg 


tctgccgccc 


catgetggge 


acgcagccgg 


22236 


gcttgtggtt 


geaategcag 


tgcaggggga 


tcagcatcat 


ctgggcctgc 


teggagctea 


22296 
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tgcccgggta 


catggccttc 


atgaaagcct 


ccagctggcg 


gaaggcctgc 


tgcgccttgc 


22356 


cgccctcggt 


gaagaagacc 


ccgcaggact 


tgctagagaa 


ctggttggtg 


gcgcagccgg 


22416 


cgtcgtgcac 


gcagcagcgc 


gcgtcgttgt 


tggccagctg 


caccacgctg 


cgcccccagc 


22476 


ggttctgggt 


gatcttggcc 


cggtcggggt 


tctccttcag 


cgcgcgctgc 


ccgttctcgc 


22536 


tcgccacatc 


catctcgatc 


gtgtgctcct 


tctggatcat 


cacggtcccg 


tgcaggcatc 


22596 


gcagcttgcc 


ctcggcctcg 


gtgcacccgt 


gcagccacag 


cgcgcagccg 


gtgcactccc 


22656 


agttcttgtg 


ggcgatctgg 


gagtgcgagt 


gcacgaagcc 


ctgcaggaag 


cggcccatca 


22716 


tcgtggtcag 


ggtcttgttg 


ctggtgaagg 


tcagcgggat 


gccgcggtgc 


tcctcgttca 


22776 


catacaggtg 


gcagatgcgg 


cggtacacct 


cgccctgctc 


gggcatcagc 


tggaaggcgg 


22836 


acttcaggtc 


gctctccacg 


cggtaccggt 


ccatcagcag 


cgtcatgact 


tccatgccct 


22896 


tctcccaggc 


cgagacgatc 


ggcaggctca 


gggggttctt 


caccgccgtt 


gtcatcttag 


22956 


tcgccgccgc 


tgaggtcagg 


gggtcgttct 


cgtccagggt 


ctcaaacact 


cgcttgccgt 


23016 


ccttctcggt 


gatgcgcacg 


gggggaaagc 


tgaagcccac 


ggccgccagc 


tcctcctcgg 


23076 


cctgcctttc 


gtcctcgctg 


tcctggctga 


tgtcttgcaa 


aggcacatgc 


ttggtcttgc 


23136 


ggggtttctt 


tttgggcggc 


agaggcggcg 


gcggagacgt 


gctgggcgag 


cgcgagttct 


23196 


cgctcaccac 


gactatttct 


tcttcttggc 


cgtcgtccga 


gaccacgcgg 


cggtaggcat 


23256 


gcctcttctg 


gggcagaggc 


ggaggcgacg 


ggctctcgcg 


gttcggcggg 


cggctggcag 


23316 


agccccttcc 


gcgttcgggg 


gtgcgctcct 


ggcggcgctg 


ctctgactga 


cttcctccgc 


23376 


ggccggccat 


tgtgttctcc 


tagggagcaa 


caagcatgga 


gactcagcca 


tcgtcgccaa 


23436 


catcgccatc 


tgcccccgcc 


gccgccgacg 


agaaccagca 


gcagaatgaa 


agcttaaccg 


23496 


ccccgccgcc 


cagccccacc 


tccgacgccg 


ccgcggcccc 


agacatgcaa 


gagatggagg 


23556 


aatccatcga 


gattgacctg 


ggctacgtga 


cgcccgcgga 


gcacgaggag 


gagctggcag 


23616 


cgcgcttttc 


agccccggaa 


gagaaccacc 


aagagcagcc 


agagcaggaa 


gcagagagcg 


23676 


agcagcagca 


ggctgggctc 


gagcatggcg 


actacctgag 


cggggcagag 


gacgtgctca 


23736 


tcaagcatct 


ggcccgccaa 


tgcatcatcg 


tcaaggacgc 


gctgctcgac 


cgcgccgagg 


23796 


tgcccctcag 


cgtggcggag 


ctcagccgcg 


cctacgagcg 


caacctcttc 


tcgccgcgcg 


23856 


tgccccccaa 


gcgccagccc 


aacggcacct 


gcgagcccaa 


cccgcgcctc 


aacttctacc 


23916 


cggtcttcgc 


ggtgcccgag 


gccctggcca 


cctaccacct 


ctttttcaag 


aaccaaagga 


23976 
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tccccgtctc 


ctgccgcgcc 


aaccgcaccc 


gcgccgacgc 


cctgctcaac 


ctgggtcccg 


24036 


gcgcccgcct 


acctgatatc 


gcctccttgg 


aagaggttcc 


caagatcttc 


gagggtctgg 


24096 


gcagcgacga 


gactcgggcc 


gcgaacgctc 


tgcaaggaag 


cggagaggag 


catgagcacc 


24156 


acagcgccct 


ggtggagttg 


gaaggcgaca 


acgcgcgcct 


ggcggtgctc 


aagcgcacgg 


24216 


tcgagctgac 


ccacttcgcc 


tacccggcgc 


tcaacctgcc 


ccccaaggtc 


atgagcgccg 


24276 


tcatggacca 


ggtgctcatc 


aagcgcgcct 


cgcccctctc 


ggatgaggac 


atgcaggacc 


24336 


ccgagagctc 


ggacgagggc 


aagcccgtgg 


tcagcgacga 


gcagctggcg 


cgctggctgg 


24396 


gagcgagtag 


caccccccag 


agcttggaag 


agcggcgcaa 


gctcatgatg 


gccgtggtcc 


24456 


tggtgaccgt 


ggagctggag 


tgtctgcgcc 


gcttcttcgc 


cgacgcagag 


accctgcgca 


24516 


aggtcgagga 


gaacctgcac 


tacctcttca 


ggcacgggtt 


tgtgcgccag 


gcctgcaaga 


24576 


tctccaacgt 


ggagctgacc 


aacctggtct 


cctacatggg 


catcctgcac 


gagaaccgcc 


24636 


tggggcagaa 


cgtgctgcac 


accaccctgc 


gcggggaggc 


ccgccgcgac 


tacatccgcg 


24696 


actgcgtcta 


cctgtacctc 


tgccacacct 


ggcagacggg 


catgggcgtg 


tggcagcagt 


24756 


gcctggagga 


gcagaacctg 


aaagagctct 


gcaagctcct 


gcagaagaac 


ctgaaggccc 


24816 


tgtggaccgg 


gttcgacgag 


cgcaccaccg 


cctcggacct 


ggccgacctc 


atcttccccg 


24876 


agcgcctgcg 


gctgacgctg 


cgcaacggac 


tgcccgactt 


tatgagtcaa 


agcatgttgc 


24936 


aaaactttcg 


ctctttcatc 


ctcgaacgct 


ccgggatcct 


gcccgccacc 


tgctccgcgc 


24996 


tgccctcgga 


cttcgtgccg 


ctgaccttcc 


gcgagtgccc 


cccgccgctc 


tggagccact 


25056 


gctacctgct 


gcgcctggcc 


aactacctgg 


cctaccactc 


ggacgtgatc 


gaggacgtca 


25116 


gcggcgaggg 


tctgctcgag 


tgccactgcc 


gctgcaacct 


ctgcacgccg 


caccgctccc 


25176 


tggcctgcaa 


cccccagctg 


ctgagcgaga 


cccagatcat 


cggcaccttc 


gagttgcaag 


25236 


gccccggcga 


gggcaagggg 


ggtctgaaac 


tcaccccggg 


gctgtggacc 


tcggcctact 


25296 


tgcgcaagtt 


cgtgcccgag 


gactaccatc 


ccttcgagat 


caggttctac 


gaggaccaat 


25356 


cccagccgcc 


caaggccgaa 


ctgtcggcct 


gcgtcatcac 


ccagggggcc 


atcctggccc 


25416 


aattgcaagc 


catccagaaa 


tcccgccaag 


aatttctgct 


gaaaaagggc 


cacggggtct 


25476 


accbggaccc 


ccagaccgga 


cracrcracrctica 


accccagctt 


cccccaaaat 

W Vp* Vp* *A *H 


accccaaaaa 


25536 


agcagcaaga 


agctgaaagt 


ggagctgccg 


ccgccggagg 


atttggagga 


agactgggag 


25596 


agcagtcagg 


cagaggagga 


ggagatggaa 


gactgggaca 


gcactcaggc 


agaggaggac 


25656 


agcctgcaag 


acagtctgga 


agacgaggtg 


gaggaggagg 


cagaggaaga 


agcagccgcc 


25716 
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gccagaccgt cgtcctcggc ggagaaagca agcagcacgg ataccatctc cgctccgggt 25776 

cggggtcgcg gcgaccgggc ccacagtagg tgggacgaga ccgggcgctt cccgaacccc 25836 

accacccaga ccggtaagaa ggagcggcag ggatacaagt cctggcgggg gcacaaaaac 25896 

gccatcgtct cctgcttgca agcctgcggg ggcaacatct ccttcacccg ccgctacctg 25956 

ctcttccacc gcggggtgaa cttcccccgc aacatcttgc attactaccg tcacctccac 26016 

agcccctact actgtttcca agaagaggca gaaacccagc agcagcagaa aaccagcggc 26076 

agcagcagct agaaaatcca cagcggcggc aggtggactg aggatcgcag cgaacgagcc 26136 

ggcgcagacc cgggagctga ggaaccggat ctttcccacc ctctatgcca tcttccagca 26196 

gagtcggggg caggagcagg aactgaaagt caagaaccgt tctctgcgct cgctcacccg 26256 

cagttgtctg tatcacaaga gcgaagacca acttcagcgc actctcgagg acgccgaggc 26316 

tctcttcaac aagtactgcg cgctcactct taaagagtag cccgcgcccg cccacacacg 26376 

gaaaaaggcg ggaattacgt caccacctgc gcccttcgcc cgaccatcat catgagcaaa 26436 

gagattccca cgccttacat gtggagctac cagccccaga tgggcctggc cgccggcgcc 26496 

gcccaggact actccacccg catgaactgg ctcagcgccg ggcccgcgat gatctcacgg 26556 

gtgaatgaca tccgcgcccg ccgaaaccag atactcctag aacagtcagc gatcaccgcc 26616 

acgccccgcc atcaccttaa tccgcgtaat tggcccgccg ccctggtgta ccaggaaatt 26676 

ccccagccca cgaccgtact acttccgcga gacgcccagg ccgaagtcca gctgactaac 26736 

tcaggtgtcc agctggccgg cggcgccgcc ctgtgtcgtc accgccccgc tcagggtata 26796 

aagcggctgg tgatccgagg cagaggcaca cagctcaacg acgaggtggt gagctcttcg 26856 

ctgggtctgc gacctgacgg agtcttccaa ctcgccggat cggggagatc ttccttcacg 26916 

cctcgtcagg ccgtcctgac tttggagagt tcgtcctcgc agccccgctc gggtggcatc 26976 

ggcactctcc agttcgtgga ggagttcact ccctcggtct acttcaaccc cttctccggc 27036 

tcccccggcc actacccgga cgagttcatc ccgaacttcg acgccatcag cgagtcggtg 27096 

gacggctacg attgaatgtc ccatggtggc gcagctgacc tagctcggct tcgacacctg 27156 

gaccactgcc gccgcttccg ctgcttcgct cgggatctcg ccgagtttgc ctactttgag 27216 

ctgcccgagg agcaccctca gggcccggcc cacggagtgc ggatcatcgt cgaagggggc 27276 

ctcgactccc acctgcttcg gatcttcagc cagcgaccga tcctggtcga gcgcgagcaa 27336 

ggacagaccc ttctgaccct gtactgcatc tgcaaccacc ccggcctgca tgaaagtctt 27396 
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tgttgtctgc tgtgtactga gtataataaa agctgagatc agcgactact ccggactcga 27456 

ttgtggtgtt cctgctatca accggtccct gttcttcacc gggaacgaga ccgagctcca 27516 

gcttcagtgt aagccccaca agaagtacct cacctggctg ttccagggct ccccgatcgc 27576 

cgttgtcaac cactgcgaca acgacggagt cctgctgagc ggccccgcca accttacttt 27636 

ttccacccgc agaagcaagc tccagctctt ccaacccttc ctccccggga cctatcagtg 27696 

cgtctcggga ccctgccatc acaccttcca cctgatcccg aataccacag cgccgctccc 27756 

cgctactaac aaccaaacta cccaccatcg ccaccgtcgc gacctttctg aatctaacac 27816 

taccacccac accggaggtg agctccgagg tcgaccaacc tctgggattt actacggccc 27876 

ctgggaggtg gtggggttaa tagcgctagg cctagttgtg ggtgggcttt tggctctctg 27936 

ctacctatac ctcccttgct gttcgtactt agtggtgctg tgttgctggt ttaagaaatg 27996 

gggaagatca ccctagtgag ctgcggtgcg ctggtggcgg tggtggtgtt ttcgattgtg 28056 

ggactgggcg gcgcggctgt agtgaaggag aaggccgatc cctgcttgca tttcaatccc 28116 

gacaattgcc agctgagttt tcagcccgat ggcaatcggt gcgcggtgct gatcaagtgc 28176 

ggatgggaat gcgagaacgt gagaatcgag tacaataaca agactcggaa caatactctc 28236 

gcgtccgtgt ggcagcccgg ggaccccgag tggtacaccg tctctgtccc cggtgctgac 28296 

ggctccccgc gcaccgtgaa caatactttc atttttgcgc acatgtgcga cacggtcatg 28356 

tggatgagca agcagtacga tatgtggccc cccacgaagg agaacatcgt ggtcttctcc 28416 

atcgcttaca gcgcgtgcac ggcgctaatc accgctatcg tgtgcctgag cattcacatg 28476 

ctcatcgcta ttcgccccag aaataatgcc gaaaaagaga aacagccata acacgttttt 28536 

tcacacacct ttttcagacc atggcctctg ttaaattttt gcttttattt gccagtctca 28596 

ttactgttat aagtaatgag aaactcacta tttacattgg cactaaccac actttagacg 28656 

gaattccaaa atcctcatgg tattgctatt ttgatcaaga tccagactta actatagaac 28716 

tgtgtggtaa caagggaaaa aatacaagca ttcatttaat taactttaat tgcggagaca 28776 

atttgaaatt aattaatatc actaaagagt atggaggtat gtattactat gttgcagaaa 28836 

ataacaacat gcagttttat gaagttactg taactaatcc caccacacct agaacaacaa 28896 

caaccaccac cacaaaaact acacctgtta ccactatgca gctcactacc aataacattt 28956 

ttgccatgcg tcaaatggtc aacaatagca ctcaacccac cccacccagt gaggaaattc 29016 

ccaaatccat gattggcatt attgttgctg tagtggtgtg catgttgatc atcgccttgt 29076 

gcatggtgta ctatgccttc tgctacagaa agcacagact gaacgacaag ctggaacact 29136 
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tactaagtgt 


tgaattttaa 


ttttttagaa 


ccatgaagat 


cctaggcctt 


ttaatttttt 


29196 


ctatcattac 


ctctgctcta 


tgcaattctg 


acaatgagga 


cgttactgtc 


gttgtcggaa 


29256 


ccaattatac 


actgaaaggt 


ecagegaagg 


gtatgettte 


gtggtattgc 


tggtttggaa 


29316 


ctgacgagca 


acagacagag 


etctgeaatg 


ctcaaaaagg 


caaaacctca 


aattctaaaa 


29376 


tctctaatta 


teaatgeaat 


ggcactgact 


tagtactget 


caatgtcacg 


aaagcatatg 


29436 


ctggcagcta 


cacctgccct 


ggagatgata 


ctgagaacat 


gattttttac 


aaagtggaag 


29496 


tggttgatcc 


cactactcca 


cctccaccca 


ccacaactac 


tcacaccaca 


cacacagaac 


29556 


aaaccacagc 


agaggaggca 


gcaaagttag 


ecttgeaggt 


ccaagacagt 


tcatttgttg 


29616 


gcattacccc 


tacacctgat 


cagcggtgtc 


eggggctget 


cgtcagcggc 


attgtcggtg 


29676 


tgctttcggg 


attagcagtc 


ataatcatct 


gcatgttcat 


ttttgcttgc 


tgctatagaa 


29736 


ggctttaccg 


acaaaaatca 


gacccactgc 


tgaacctcta 


tgtttaattt 


tttccagagc 


29796 


catgaaggca 


gttagcactc 


tagttttttg 


ttctttgatt 


ggcactgttt 


ttagtgttag 


29856 


ctttttgaaa 


caaatcaatg 


ttactgaggg 


ggaaaatgtg 


acactggtag 


gcgtagaggg 


29916 


tgctcaaaat 


accacctgga 


caaaattcca 


tctagatggg 


tggaaagaaa 


tttgcacctg 


29976 


gaatgtcagt 


acttatacat 


gtgaaggagt 


taatcttacc 


attgtcaatg 


teagecaaat 


30036 


tcaaaagggt 


tggattaaag 


ggcaatctgt 


tagtgttagc 


aatagtgggt 


actataccca 


30096 


gcatactctt 


atctatgaca 


ttatagttat 


accactgcct 


acacctagcc 


cacctagcac 


30156 


taccacacag 


acaacccaca 


ctacacaaac 


aaccacatac 


agtacatcaa 


atcagcctac 


30216 


caeca ctaca 


acagcagagg 


ttgccagctc 


gtctggggtc 


cgagtggcat 


ttttgatgtt 


30276 


ggccccatct 


agcagtccca 


ctgetagtae 


caatgagcag 


actactgaat 


ttttgtccac 


30336 


tgtcgagagc 


cacaccacag 


ctacctcgag 


tgccttctct 


agcaccgcca 


atctatcctc 


30396 


gctttcctct 


acaccaatca 


gtcccgctac 


tactcctacc 


cccgctattc 


tccccactcc 


30456 


cctgaagcaa 


acagaeggeg 


acatgeaatg 


gcagatcacc 


ctgctcattg 


tgatcgggtt 


30516 


ggtcatcctg 


gccgtgttgc 


tctactacat 


cttctgccgc 


cgcattccca 


acgcgcaccg 


30576 


caagccggcc 


tacaagccca 


tcgttgtcgg 


geagceggag 


ccgcttcagg 


tggaaggggg 


30636 


tctaaggaat 


cttctcttct 


cttttacagt 


atggtgattg 


aattatgatt 


cctagacaaa 


30696 


tcttgatcac 


tattcttatc 


tgcctcctcc 


aagtctgtgc 


caccctcgct 


ctggtggcca 


30756 


acgccagtcc 


agactgtatt 


gggcccttcg 


cctcctacgt 


gctctttgcc 


ttcatcacct 


30816 
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gcatctgctg 


ctgtagcata 


gtctgcctgc ttatcacctt 


cttccagttc attgactgga 


30876 


tctttgtgcg 


catcgcctac 


ctgcgccacc acccccagta 


ccgcgaccag cgagtggcgc 


30936 


ggctgctcag 


gatcctctga 


taagcatgcg ggctctgeta 


cttctcgcgc ttctgctgtt 


30996 


agtgctcccc 


cgtcccgtcg 


acccccggac ccccacccag 


tcccccgagg aggtcegcaa 


31056 


atgcaaattc 


caagaaccct 


ggaaattcct caaatgetae 


cgccaaaaat cagacatgea 


31116 


tcccagctgg 


atcatgatca 


ttgggatcgt gaacattctg 


gcctgcaccc tcatctcctt 


31176 


tgtgatttac 


ccctgctttg 


actttggttg gaactcgcca 


gaggegctet atctcccgcc 


31236 


tgaacctgac 


acaccaccac 


agcaacctca ggcacacgca 


ctaccaccac caccacagcc 


31296 


taggccacaa 


tacatgccca 


tattagacta tgaggecgag 


ccacagcgac ccatgctccc 


31356 


cgctattagt 


tacttcaatc 


taaceggegg agatgactga 


cccactggcc aacaacaacg 


31416 


tcaacgacct 


tctcctggac 


atggacggcc gcgcctcgga 


gcagcgactc gcccaacttc 


31476 


gcattcgcca 


gcagcaggag 


agagcegtea aggagctgea 


ggaeggcata gccatccacc 


31536 


agtgcaagaa 


aggcatcttc 


tgcctggtga aacaggecaa 


gatctcctac gaggtcaccc 


31596 


agaccgacca 


tcgcctctcc 


tacgagctcc tgcagcagcg 


ccagaagttc acctgcctgg 


31656 


tcggagtcaa 


ccccatcgtc 


atcacccagc agtegggega 


taccaagggg tgcatccact 


31716 


gctcctgcga 


ctcccccgac 


tgcgtccaca ctctgatcaa 


gaccctctgc ggcctccgcg 


31776 


acctcctccc 


catgaactaa 


tcaccccctt atccagtgaa 


ataaagatca tattgatgat 


31836 


ttgagtttaa 


taaaaataaa 


gaatcactta cttgaaatct 


gataccaggt ctctgtccat 


31896 


gttttctgcc 


aacaccactt 


cactcccctc ttcccagctc 


tggtactgea ggccccggcg 


31956 


ggctgcaaac 


ttcctccaca 


ccctgaaggg gatgtcaaat 


tcctcctgtc cctcaatctt 


32016 


cattttatct 


tctatcag atg tec aaa aag cgc gtc egg gtg gat gat gac 
Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp 


32067 



1465 1470 

ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 32112 

Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val 
1475 1480 1485 

ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga ttc caa gag 32157 

Pro Phe He Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gin Glu 
1490 1495 1500 

aag ccc ctg ggg gtg ctg tec ctg cgt ctg gee gat ccc gtc acc 32202 

Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr 
1505 1510 1515 
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acc aag aac ggg gaa ate acc etc aag ctg gga gat ggg gtg gac 32247 
Thr Lys Asn Gly Glu lie Thr Leu Lys Leu Gly Asp Gly Val Asp 
1520 - 1525 1530 

etc gac tec teg gga aaa etc ate tec aac acg gec acc aag gec 32292 
Leu Asp Ser Ser Gly Lys . Leu lie Ser Asn Thr Ala Thr Lys Ala 
1535 1540 1545 

gec gec cct etc agt ttt tec aac aac acc att tec ctt aac atg 32337 
Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr lie Ser Leu Asn Met 
1550 1555 1560 

gat acc cct ttt tac aac aac aat gga aag tta ggc atg aaa gtc 32362 
Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu Gly Met Lys Val 
1565 1570 1575 

act get cca ctg aag ata eta gac aca gac ttg eta aaa aca ctt 32427 
Thr Ala Pro Leu Lys lie Leu Asp Thr Asp Leu Leu Lys Thr Leu 
1580 1585 1590 

gtt gta get tat gga caa ggt tta gga aca aac acc act ggt gec 32472 
Val Val Ala Tyr Gly Gin Gly Leu Gly Thr Asn Thr Thr Gly Ala 
1595 1600 1605 

ctt gtt gee caa eta gca tec cca ctt get ttt gat age aat age 32517 
Leu Val Ala Gin Leu Ala ' Ser Pro Leu Ala Phe Asp Ser Asn Ser 
1610 1615 1620 

aaa att gee ctt aat tta ggc aat gga cca ttg aaa gtg gat gca 32562 
Lys lie Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala 
1625 1630 1635 

aat aga ctg aac ate aat tgc aat aga gga etc tat gtt act acc 32607 
Asn Arg Leu Asn lie Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr 
1640 1645 ~ 1650 

aca aaa gat gca ctg gaa gec aat ata agt tgg get aat get atg 32652 
Thr Lys Asp Ala Leu Glu Ala Asn lie Ser Trp Ala Asn Ala Met 
1655 1660 1665 

aca ttt ata gga aat gee atg ggt gtc aat att gat aca caa aaa 32697 
Thr Phe lie Gly Asn Ala Met Gly Val Asn lie Asp Thr Gin Lys 
1670 1675 1680 

ggc ttg caa ttt ggc acc . act agt acc gtc gca gat gtt aaa aac 32742 
Gly Leu Gin Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn 
1685 1690 1695 

get tac ccc ata caa ate aaa ctt gga get ggt etc aca ttt gac 32787 
Ala Tyr Pro He Gin He Lys Leu Gly Ala Gly Leu Thr Phe Asp 
1700 1705 1710 

age aca ggt gca att gtt gca tgg aac aaa gat gat gac aag ctt 32832 
Ser Thr Gly Ala He Val Ala Trp Asn Lys Asp Asp Asp Lys Leu 
1715 1720 1725 
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aca eta tgg acc aca gec gac ccc tct cca aat tgt cac ata tat 32B77 
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His lie Tyr 
1730 1735 1740 

tct gaa aag gat get aag ctt aca ctt tgc ttg aca aag tgt ggc 32922 
Ser Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly 
1745 " 1750 1755 

agt cag att ctg ggc act gtt tec etc ata get gtt gat act ggc 32967 
Ser Gin lie Leu Gly Thr Val Ser Leu lie Ala Val Asp Thr Gly 
1760 1765 1770 

agt tta aat ccc ata aca gga aca gta acc act get ctt gtc tea 33012 
Ser Leu Asn Pro lie Thr Gly Thr Val Thr Thr Ala Leu Val Ser 
1775 1780 1785 

ctt aaa ttc gat gca aat " gga gtt ttg caa age age tea aca eta 33057 
Leu Lys Phe Asp Ala Asn Gly Val Leu Gin Ser Ser Ser Thr Leu 
1790 1795 1800 

gac tea gac tat tgg aat ttc aga cag gga gat gtt aca cct get 33102 
Asp Ser Asp Tyr Trp Asn Phe Arg Gin Gly Asp Val Thr Pro Ala 
1805 1810 1815 

gaa gee tat act aat get ata ggt ttc atg ccc aat eta aaa gca 33147 
Glu Ala Tyr Thr Asn Ala. lie Gly Phe Met Pro Asn Leu Lys Ala 
1820 1825 1830 

tac cct aaa aac aca agt gga get gca aaa agt cac att gtt ggg 33192 
Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His lie Val Gly 
1835 " 1840 1845 

aaa gtg tac eta cat ggg gat aca ggc aaa cca ctg gac etc att 33237 
Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu lie 
1850 1855 1860 

att act ttc aat gaa aca agt gat gaa tct tgc act tac tgt att 33282 
He Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys He 
1865 1870 1875 

aac ttt caa tgg cag tgg ggg get gat caa tat aaa aat gaa aca 33327 
Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin Tyr Lys Asn Glu Thr 
1880 1885 1890 

ctt gee gtc agt tea ttc acc ttt tec tat att get aaa gaa taa 33372 
Leu Ala Val Ser Ser Phe' Thr Phe Ser Tyr lie Ala Lys Glu 
1895 1900 1905 

accccactct gtaccccatc tctgtctatg gaaaaaactc tgaaacacaa aataaaataa 33432 
agttcaagtg ttttattgat tcaacagttt tacaggattc gagcagttat ttttcctcca 33492 
ccctcccagg acatggaata caccaccctc tccccccgca cagecttgaa catctgaatg 33552 
ccattggtga tggacatget tttggtctcc acgttccaca cagtttcaga gcgagccagt 33612 
ctegggtegg tcagggagat gaaaccctcc gggcactccc gcatctgcac ctcacagctc 33672 
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aacagctgag 


gattgtcctc 


ggtggtcggg 


atcacggtta 


tctggaagaa gcagaagagc 


33732 


ggcggtggga 


atcatagtcc 


gcgaacggga 


tcggccggtg 


gtgtcgcatc aggccccgca 


33792 


gcagtcgctg 


tcgccgccgc 


tccgtcaagc 


tgctgctcag 


ggggtccggg tccagggact 


33852 


ccctcagcat 


gatgcccacg 


gccctcagca 


tcagtcgtct 


ggtgcggcgg gcgcagcagc 


33912 


gcatgcggat 


ctcgctcagg 


tcgctgcagt 


acgtgcaaca 


caggaccacc aggttgttca 


33972 


acagtccata 


gttcaacacg 


ctccagccga 


aactcatcgc 


gggaaggatg ctacccacgt 


34032 


ggccgtcgta 


ccagatcctc 


aggtaaatca 


agtggcgccc 


cctccagaac acgctgccca 


34092 


tgtacatgat 


ctccttgggc 


atgtggcggt 


tcaccacctc 


ccggtaccac atcaccctct 


34152 


ggttgaacat 


gcagccccgg 


atgatcctgc 


ggaaccacag 


ggccagcacc gccccgcccg 


34212 


ccatgcagcg 


aagagacccc 


gggtcccgac 


aatggcaatg 


gaggacccac cgctcgtacc 


34272 


cgtggatcat 


ctgggagctg 


aacaagtcta 


tgttggcaca 


gcacaggcat atgctcatgc 


34332 


atctcttcag 


cactctcagc 


tcctcggggg 


tcaaaaccat 


atcccagggc acggggaact 


34392 


cttgcaggac 


agcgaacccc 


gcagaacagg 


gcaatcctcg 


cacataactt acattgtgca 


34452 


tggacagggt 


atcgcaatca 


ggcagcaccg 


ggtgatcctc 


caccagagaa gcgcgggtct 


34512 


cggtctcctc 


acagcgtggt 


aagggggccg 


gccgatacgg 


gtgatggcgg gacgcggctg 


34572 


atcgtgttcg 


cgaccgtgtt 


atgatgcagt 


tgctttcgga 


cattttcgta cttgctgtag 


34632 


cagaacctgg 


tccgggcgct 


gcacaccgat 


cgccggcggc 


ggtcccggcg cttggaacgc 


34692 


tcggtgttga 


agttgtaaaa 


cagccactct 


ctcagaccgt 


gcagcagatc tagggcctca 


34752 


ggagtgatga 


agatcccatc 


atgcctgatg 


gctctaatca 


catcgaccac cgtggaatgg 


34812 


gccagaccca 


gccagatgat 


gcaattttgt 


tgggtttcgg 


tgacggcggg ggagggaaga 


34872 


acaggaagaa 


ccatgattaa 


cttttaatcc 


aaacggtctc 


ggagcacttc aaaatgaaga 


34932 


tcgcggagat 


ggcacctctc 


gcccccgctg 


tgttggtgga 


aaataacagc caggtcaaag 


34992 


gtgatacggt 


tctcgagatg 


ttccacggtg 


gcttccagca 


aagcctccac gcgcacatcc 


35052 


agaaacaaga 


caatagcgaa 


agcgggaggg 


ttctctaatt 


cctcaatcat catgttacac 


35112 


tcctgcacca 


tccccagata 


attttcattt 


ttccagcctt 


gaatgattcg aactagttcc 


35172 


tgaggtaaat 


ccaagccagc 


catgataaag 


agctcgcgca 


gagcgccctc caccggcatt 


35232 


cttaagcaca 


ccctcataat 


tccaagatat 


tctgctcctg 


gttcacctgc agcagattga 


35292 


caagcggaat 


atcaaaatct 


ctgecgcgat 


ccctaagctc 


ctccctcagc aataactgta 


35352 
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agtactcttt 


catatcctct 


ccgaaatttt 


tagccatagg accaccagga ataagattag 


35412 


ggcaagccac 


agtacagata 


aaccgaagtc 


ctccccagtg agcattgcca aatgcaagac 


35472 


tgctataagc 


atgctggcta 


gacccggtga 


tatcttccag ataactggac agaaaatcgc 


35532 


ccaggcaatt 


tttaagaaaa 


tcaacaaaag 


aaaaatcctc caggtgcacg tttagagcct 


35592 


cgggaacaac 


gatggagtaa 


atgcaagcgg 


tgcgttccag catggttagt tagctgatct 


35652 


gtagaaaaaa 


acaaaaatga 


acattaaacc 


atgctagcct ggcgaacagg tgggtaaatc 


35712 


gttctctcca 


gcaccaggca 


ggccacgggg 


tctccggcac gaccctcgta aaaattgtcg 


35772 


ctatgattga 


aaaccatcac 


agagagacgt 


tcccggtggc cggcgtgaat gattcgacaa 


35832 


gatgaataca 


cccccggaac 


attggcgtcc 


gcgagtgaaa aaaagcgccc aaggaagcaa 


35892 


taaggcacta 


caatgctcag 


tctcaagtcc 


agcaaagcga tgccatgcgg atgaagcaca 


35952 


aaattctcag 


gtgcgtacaa 


aatgtaatta 


ctcccctcct gcacaggcag caaagccccc 


36012 


gatccctcca 


ggtacacata 


caaagcctca 


gcgtccatag cttaccgagc agcagcacac 


36072 


aacaggcgca 


agagtcagag 


aaaggctgag 


ctctaacctg tccacccgct ctctgctcaa 


36132 


tatatagccc 


agatctacac 


tgacgtaaag 


gccaaagtct aaaaataccc gccaaataat 


36192 


cacacacgcc 


cagcacacgc 


ccagaaaccg 


gtgacacact caaaaaaata cgcgcacttc 


36252 


ctcaaacgcc 


caaactgccg 


tcatttccgg 


gttcccacgc tacgtcatca aaattcgact 


36312 


ttcaaattcc 


gtcgaccgtt 


aaaaacgtcg 


cccgccccgc ccctaacggt cgccgctccc 


36372 


gcagccaatc 


accgccccgc 


atccccaaat 


tcaaatacct catttgcata ttaacgcgca 


36432 


ccaaaagttt 


gaggtatatt 


attgatgatg 




36462 



<210> 2 

<211> 530 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 

<400> 2 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
1 5 10 15 

Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
50 55 60 
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Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn 
65 70 75 80 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe Asp Glu Arg Ser Arg 
100 105 110 

Trp Gly Gly Gin Leu Lys Thr He Met His Thr Asn Met Pro Asn Val 
115 120 125 

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 
130 135 140 

Arg Lys Thr Pro Asn Gly Val Thr Val Thr Asp Gly Ser Gin Asp Glu 
145 150 155 " 160 

Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser 
165 170 175 

Val Thr Met Thr lie Asp Leu Met Asn Asn Ala He He Asp Asn Tyr 
180 185 190 

Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp He Gly Val 
195 200 205 

Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu 
210 215 220 

Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp He 
225 230 235 240 

Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser 
245 250 255 

Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe Gin 
260 265 270 

He Leu Tyr Glu Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp 
275 280 285 

Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr 
290 295 300 

Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe Ala 
305 310 315 320 

Ser Ala Ala Thr Leu Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys 
325 330 335 



He Val He Gin Pro Val Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn 
340 345 350 
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Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala 
355 360 " 365 

Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu 
370 375 " 380 

Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser Leu 
385 390 395 400 

Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin Val 
405 410 415 

Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys 
420 425 430 

Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe 
435 440 445 

Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin He Leu 
450 455 460 

Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val Pro 
465 470 475 480 

Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser He Arg Gly 
485 490 495 

Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr 
500 505 510 

Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg 
515 520 525 



Thr Phe 
530 



<210> 3 

<211> 933 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 

<400> 3 



Met Ala Thr Pro Ser Met Leu Pro 
1 5 

Gly Gin Asp Ala Ser Glu Tyr Leu 
20 

Arg Ala Thr Asp Thr Tyr Phe Ser 
35 40 

Thr Val Ala Pro Thr His Asp Val 
50 55 



Gin Trp Ala Tyr Met His He Ala 
10 15 

Ser Pro Gly Leu Val Gin Phe Ala 
25 30 

Leu Gly Asn Lys Phe Arg Asn Pro 
45 

Thr Thr Asp Arg Ser Gin Arg Leu 
60 
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Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 * 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 ~ 120 125 

Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly 
130 135 140 

Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly lie Ser 
145 150 155 160 

He Thr Lys Asp Gly He Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro 
165 170 * * 175 

He Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala 
180 185 190 

Glu Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala 
195 200 205 

Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys 
210 215 ~ 220 

Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly 
225 230 ' 235 240 

Gly Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe Asp Asn Arg Ser 
245 250 255 

Ala Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr Thr Glu Asn 
260 265 270 

Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys Ala Gly Thr 
275 280 285 

Asp Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin Ser Met Pro Asn 
290 295 300 

Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr 
305 310 315 320 

Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin 
325 330 335 

Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr 
340 345 350 

Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met 
355 360 365 
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Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie Glu 

370 375 " " 380 

Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp 
385 390 395 400 

Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Ala 
405 410 415 

Asp Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu 
420 & 425 " 430 

Leu Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn 
435 440 445 

Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp 
450 455 460 

Ser Tyr Lys Tyr Thr Pro Ala Asn He Thr Leu Pro Thr Asn Thr Asn 
465 470 475 480 

Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp 
485 490 495 

Ala Tyr He Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn 
500 505 * 510 

Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser 
515 520 525 

Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro 
530 535 540 

Gin Lys Phe Phe Ala lie Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr 
545 ■ 550 555 560 

Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser 
565 570 575 

Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ala Phe Thr 
580 585 590 

Ser He Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala 
595 600 605 

Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe 
610 615 620 

Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro He Pro Ala Asn 
625 630 635 640 

Ala Thr Asn Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala Phe 
645 650 ^ * 655 

Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu 
660 665 670 
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Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr 
675 680 685 

Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He 
690 695 700 

Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
705 710 715 720 

Pro Asn Glu Phe Glu He Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn 
725 730 735 

Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu 
740 745 750 

Ala His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr 
755 760 765 

Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg 
770 775 780 

Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu 
785 790 795 800 

Ala Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr 
805 810 815 

Met Arg Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He 
820 825 830 

Gly Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp 
835 840 845 

Arg Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met Gly 
850 855 860 

Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 
865 870 875 880 

Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu 
885 890 895 

Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro 
900 905 ^ 910 

His Arg Gly Val He Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala 
915 920 925 

Gly Asn Ala Thr Thr 
930 



<210> 4 

<211> 445 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 
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<400> 4 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
15 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe lie Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu lie Thr Leu 
50 55 60 

Lys Leu Gly Asp Gly Val Asp Leu Asp Ser Ser Gly Lys Leu lie Ser 
65 ~ 70 75 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 
85 90 95 

lie Ser Leu Asn Met Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu 
100 105 110 

Gly Met Lys Val Thr Ala Pro Leu Lys lie Leu Asp Thr Asp Leu Leu 
115 120 125 

Lys Thr Leu Val Val Ala Tyr Gly Gin Gly Leu Gly Thr Asn Thr Thr 
130 135 140 

Gly Ala Leu Val Ala Gin Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn 
145 150 155 160 

Ser Lys lie Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala 
165 170 175 

Asn Arg Leu Asn lie Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr Thr 
180 185 190 

Lys Asp Ala Leu Glu Ala Asn lie Ser Trp Ala Asn Ala Met Thr Phe 
195 200 205 

He Gly Asn Ala Met Gly Val Asn He Asp Thr Gin Lys Gly Leu Gin 
210 215 220 

Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn Ala Tyr Pro He 
225 230 . 235 240 

Gin He Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala He 
245 250 255 

Val Ala Trp Asn Lys Asp Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala 
260 265 270 

Asp Pro Ser Pro Asn Cys His He Tyr Ser Glu Lys Asp Ala Lys Leu 
275 280 285 
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Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin lie Leu Gly Thr Val Ser 
290 295 300 

Leu He Ala Val Asp Thr Gly Ser Leu Asn Pro He Thr Gly Thr Val 
305 310 315 320 

Thr Thr Ala Leu Val Ser Leu Lys Phe Asp Ala Asn Gly Val Leu Gin 
325 330 335 

Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp Asn Phe Arg Gin Gly Asp 
340 345 350 

Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala He Gly Phe Met Pro Asn 
355 360 365 

Leu Lys Ala Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His He 
370 375 380 

Val Gly Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu 
385 390 395 400 

He He Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys He 
405 410 " 415 

Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin Tyr Lys Asn Glu Thr Leu 
420 425 430 

Ala Val Ser Ser Phe Thr Phe Ser Tyr He Ma Lys Glu 
435 440 445 



<210> 5 

<211> 36604 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan6 
<220> 

<221> CDS 

<222> (13878) (15467) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (18284) . ♦ (21112) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (32162) (33493) 

<223> L5 Fiber 

<400> 5 

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 
atttggggag ggaggaaggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120 
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tgacgttttg 


atgacgtggc 


tatgaggcgg 


agccggtttg 


caagttctcg 


tgggaaaagt 


180 


gacgtcaaac 


gaggtgtggt 


ttgaacacgg 


aaatactcaa 


ttttcccgcg 


ctctctgaca 


240 


ggaaatgagg 


tgtttctggg 


cggatgcaag 


tgaaaacggg 


ccattttcgc 


gcgaaaactg 


300 


aatgaggaag 


tgaaaatctg 


agtaatttcg 


cgtttatggc 


agggaggagt 


atttgccgag 


360 


ggccgagtag 


actttgaccg 


attacgtggg 


ggtttcgatt 


accgtatttt 


tcacctaaat 


420 


ttccgcgtac 


ggtgtcaaag 


tccggtgttt 


ttacgtaggc 


gtcagctgat 


cgccagggta 


480 


tttaaacctg 


cgctctctag 


tcaagaggcc 


actcttgagt 


gccagcgagt 


agagttttct 


540 


cctccgcgcc 


gcgagtcaga 


tctacacttt 


gaaagatgag 


gcacotgaga 


gacctgcccg 


600 


gtaatgtttt 


cctggctact 


gggaacgaga 


ttctggaatt 


ggtggtggac 


gccatgatgg 


660 


gtgacgaccc 


tccagagccc 


cctaccccat 


ttgaggcgcc 


ttcgctgtac 


gatttgtatg 


720 


atctggaggt 


ggatgtgccc 


gagagcgacc 


ctaacgagga 


ggcggtgaat 


gatttgttta 


780 


gcgatgccgc 


gctgctggct 


gccgagcagg 


ctaatacgga 


ctctggctca 


gacagcgatt 


840 


cctctctcca 


taccccgaga 


cccggcagag 


gtgagaaaaa 


gatccccgag 


cttaaagggg 


900 


aagagctcga 


cctgcgctgc 


tatgaggaat 


gcttgcctcc 


gagcgatgat 


gaggaggacg 


960 


aggaggcgat 


tcgagctgcg 


gtgaaccagg 


gagtgaaaac 


tgcgggcgag 


agctttagcc 


1020 


tggactgtcc 


tactctgccc 


ggacacggct 


gtaagtcttg 


tgaatttcat 


cgcatgaata 


1080 


ctggagataa 


gaatgtgatg 


tgtgccctgt 


gctatatgag 


agcttacaac 


cattgtgttt 


1140 


acagtaagtg 


tgattaactt 


tagttgggaa 


ggcagagggt 


gactgggtgc 


tgactggttt 


1200 


atttatgtat 


atgttttttt 


atgtgtaggt 


cccgtctctg 


acgtagatga 


gacccccact 


1260 


tcagagtgca 


tttcatcacc 


cccagaaatt 


ggcgaggaac 


cgcccgaaga 


tattattcat 


1320 


agaccagttg 


cagtgagagt 


caccgggcgg 


agagcagctg 


tggagagttt 


ggatgacttg 


1380 


ctacagggtg 


gggatgaacc 


tttggacttg 


tgtacccgga 


aacgccccag 


gcactaagtg 


1440 


ccacacatgt 


gtgtttactt 


aaggtgatgt 


cagtatttat 


agggtgtgga 


gtgcaataaa 


1500 


atccgtgttg 


actttaagtg 


cgtgttttat 


gactcagggg 


tggggactgt 


gggtatataa 


1560 


gcaggtgcag 


acctgtgtgg 


tcagttcaga 


gcaggactca 


tggagatctg 


gactgtcttg 


1620 


gaagactttc 


accagactag 


acagttgcta 


gagaactcat 


cggagggagt 


ctcttacctg 


1680 


tggagattct 


gcttcggtgg 


gcctctagct 


aagctagtct 


atagggccaa 


acaggattat 


1740 


aaggaacaat 


ttgaggatat 


tttgagagag 


tgtcctggta 


tttttgactc 


tctcaacttg 


1800 


ggccatcagt 


ctcactttaa 


ccagagtatt 


ctgagagccc 


ttgacttttc 


tactcctggc 


1860 
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agaactaccg 


ccgcggtagc 


cttttttgcc 


tttattcttg 


acaaatggag 


tcaagaaacc 


1920 


catttcagca 


gggattaccg 


tctggactgc 


ttagcagtag 


ctttgtggag 


aacatggagg 


1980 


tgccagcgcc 


tgaatgcaat 


ctpcggctac 


ttgccagtac 


agccggtaga 


cacgctgagg 


2040 


atcctgagtc 


tccagtcacc 


ccaggaacac 


caacgccgcc 


agcagccgca 


gcaggagcag 


2100 


cagcaagagg 


aggaccgaga 


agagaacccg 


agagccggtc 


tggaccctcc 


ggtggcggag 


2160 


gaggaggagt 


agctgacttg 


tttcccgagc 


tgcgccgggt 


gctgactagg 


tcttccagtg 


2220 


gacgggagag 


ggggattaag 


cgggagaggc 


atgaggagac 


tagccacaga 


actgaactga 


2280 


ctgtcagtct 


gatgagccgc 


aggcgcccag 


aatcggtgtg 


gtggcatgag 


gtgcagtcgc 


2340 


aggggataga 


tgaggtctcg 


gtgatgcatg 


agaaatattc 


cctagaacaa 


gtcaagactt 


2400 


gttggttgga 


gcccgaggat 


gattgggagg 


tagccatcag 


gaattatgcc 


aagctggctc 


2460 


tgaagccaga 


caagaagtac 


aagattacca 


aactgattaa 


tatcagaaat 


tcctgctaca 


2520 


tttcagggaa 


tggggccgag 


gtggagatca 


gtacccagga 


gagggtggcc 


ttcagatgtt 


2580 


gtatgatgaa 


tatgtacccg 


ggggtggtgg 


gcatggaggg 


agtcaccttt 


atgaacacga 


2640 


ggttcagggg 


tgatgggtat 


aatggggtgg 


tctttatggc 


caacaccaag 


ctgacagtgc 


2700 


acggatgctc 


cttctttggc 


ttcaataaca 


tgtgcatcga 


ggcctggggc 


agtgtttcag 


2760 


tgaggggatg 


cagcttttca 


gccaactgga 


tgggggtcgt 


gggcagaacc 


aagagcaagg 


2820 


tgtcagtgaa 


gaaatgcctg 


ttcgagaggt 


gccacctggg 


ggtgatgagc 


gagggcgaag 


2880 


ccaaagtcaa 


acactgcgcc 


tctaccgaga 


cgggctgctt 


tgtgctgatc 


aagggcaatg 


2940 


cccaagtcaa 


gcataacatg 


atctgtgggg 


cctcggatga 


gcgcggctac 


cagatgctga 


3000 


cctgcgccgg 


tgggaacagc 


catatgctgg 


ccaccgtgca 


tgtggcctcg 


cacccccgca 


3060 


agacatggcc 


cgagttcgag 


cacaacgtca 


tgacccgctg 


caatgtgcac 


ctgggctccc 


3120 


gccgaggcat 


gttcatgccc 


taccagtgca 


acatgcaatt 


tgtgaaggtg 


ctgctggagc 


3180 


ccgatgccat 


gtccagagtg 


agcctgacgg 


gggtgtttga 


catgaatgtg 


gagctgtgga 


3240 


aaattctgag 


atatgatgaa 


tccaagacca 


ggtgccgggc 


ctgcgaatgc 


ggaggcaagc 


3300 


acgccaggct 


tcagcccgtg 


tgtgtggagg 


tgacggagga 


cctgcgaccc 


gatcatttgg 


3360 


tgttgtcctg 


caacgggacg 


gagttcggct 


ccagcgggga 


agaatctgac 


tagagtgagt 


3420 


agtgtttggg 


gctgggtgtg 


agcctgcatg 


aggggcagaa 


tgactaaaat 


ctgtggtttt 


3480 


ctgtgtgttg 


cagcagcatg 


agcggaagcg 


cctcctttga 


gggaggggta 


ttcagccctt 


3540 
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atctgacggg 


gcgtctcccc 


tcctgggcgg 


gagtgcgtca 


gaatgtgatg 

3 333 


ggatccacqq 

33 33 


3600 


tggacggccg 


gcccgtgcag 


cccgcgaact 


cttcaaccct 


gacctacgcg 


accctgagct 


3660 


cctcgtccgt 


ggacgcagct 

3 3 3 3 


gccgccgcag 
3 3 3 3 


ctgctgcttc 


cgccgccagc 


qccqtiqcqcq 
3 3 33 3 


3720 


gaatggccct 


gggcgccggc 

3 3 3 3 ^^"3 3 » 


tactacagct 


ctctggtggc 
33 ^33^ 


caactcgagt 


tccaccaata 


3780 


atcccgccag 


cctgaacgag 


qaqaaqctqc 


tqctqctqat 


ggcccagctc 


aaagccctga 


3840 


cccagcgcct 


gqgcgaqctg 

3 3? 333 


acccagcagg 


tggctcagct 


qcaqqcqqaq 


acaccrgqcccr 


3900 


cggttgccac 


ggtgaaaacc 


aaataaaaaa 


tgaatcaata 


aataaacgga 


gacggttgtt 


3960 


gattttaaca 


cagagtcttg 


aatctttatt 


tgatttttcg 


cgcgcqgtag 

3 ^3 33 WW »^JJ 


gccctggacc 


4020 


accggtctcg 


atcattgagc 


acccggtgga 
3? 33 


tcttttccag 


qacccqqtaq 
3 3 3 3 


agqtqqqctt 
33 3 3 9 ^ ^ ^ 


4080 


ggatgttgag 

3 3 9 3 9 


gtacatqggc 

3 333 


atgagcccgt 


cccgqqgqtq 

^^^33939 "3 


gaggtagctc 


cattacaaacf 
3 333 


4140 


cctcgtqctc 

3 3 


ggggatggtg 


ttgtaaatca 


cccagtcata 


gcaggggcgc 

3 "3333 w 3 


aqqgcgtqqt 

3 3 3^3 33 


4200 


gctgcacgat 


gtccttgagg 

3 3> 3 3 


aggagactga 


tggccacgqg 
33 333 


cagccccttg 


gtgtaqgtqt 
3 3 3 3 3 


4260 


tgacgaacct 


gttgagctgg 


gagggatgca 
3 339 3 


tgcgggggga 


gatgagatgc 


atcttggcct 


4320 


ggatcttgag 


attggcgatg 

w 33 3 3 


ttcccgccca 


qatccccrccq 


agqattcatg 


ttatqcaaaa 


4380 


ccaccagcac 


qqtgtatcccr 

3 3 3 *~ vvv 3 


atacacttaa 


aqaatttat c 

*J *J *^ V V >J Np» Sp* 


atcrcaact fccr 


aaaaaaaaaa 


4440 


cgtgaaagaa 


tttqqaqacq 

33** 3 3 


cccttgtgac 


cgcccaggtt 


ttccatgcac 


tcatccataa 


4500 


tgatggcgat 


gggcccgtgg 
333 ^3^33 


qcggcggcct 
3 33 33 


gggcaaaqac 
333 3 ^ w 


gtttcqqqqg 


tcggacacat 


4560 


cgtagttgtg 


gtcctgggtg 
3 333 3 


agctcgtcat 


aggccatttt 


aatgaatttg 


gggcggaggg 

333 w 33 , *333 


4620 


tgcccgactg 


ggggacgaag 
3733 3 


gtgccctcga 


tcccggggqc 
33333 


gtagttgccc 


tcgcagatct 


4680 


gcatctccca 


ggccttgagc 
33 33 


tcggaggggg 
^^33^^3333 


ggatcatgtc 


ca cct gcqqq 

3 3 3 9 


gcgatgaaaa 


4740 


aaacggtttc 


cggggcgggg 

"3333 3333 


gagatgaqct 

3 3 ^ 3 3 


qggccgaaag 

i?33 3 3 


caqqttccqq 


aqcagctgqq 

3 3 ^^333 


4800 


acttgccgca 


accqgtqggq 


ccgtagatga 


ccccgatgac 


cagctgcaag 
w 3 3 ^ 3 3 3 


3 3 ^ B 3 ** *^3 


4860 


qgqaqaqaca 


gctgccgtcc 


tcgcgqagaa 

w 3 w 3 3 W 33** 


aoaqaaccac 
33 3333 wn * ww 


ct cgttcatc 


atctcgcgca 


4920 


catgcatgtt 


ctcgcgcacg 


agttccgcca 


qqaqqcqctc 

33"""33^3 *'* 


gccccccagc 


aaaaaoaact 


4980 


cttgcagcga 


ggcgaagttt 
33 3 3 


ttcagcggct 


tqaqtccqtc 


qaccatagoc 

3 3 "333 


attttggaga 


5040 


qqqtctqttg 


caagagttcc 


agacggtccc 


**3**3 ^- ww 3 3 


cratcrtcrctcfc 


acracrcatctc 


5100 


gatccagcag 


acctcctcgt 


ttcgcgggtt 


ggggcgactg 


cgggagtagg 


gcaccaggcg 


5160 


atgggcgtcc 


agcgaggcca 


ggg'tccggtc 


cttccagggc 


cgcagggtcc 


gcgtcagcgt 


5220 


ggtctccgtc 


acggtgaagg 


ggtgcgcgcc 


gggctgggcg 


cttgcgaggg 


tgcgcttcag 


5280 
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gctcatccgg ctggtcgaga accgctcccg gtcggcgccc tgcgcgtcgg ccaggtagca 5340 

attgagcatg agttcgtagt tgagcgcctc ggccgcgtgg cccttggcgc ggagcttacc 5400 

tttggaagtg tgtccgcaga cgggacagag gagggacttg agggcgtaga gcttgggggc 5460 

gaggaagacg gactcggggg cgtaggcgtc cgcgccgcag ctggcgcaga cggtctcgca 5520 

ctccacgagc caggtgaggt cggggcggtt ggggtcaaaa acgaggtttc ctccgtgctt 5580 

tttgatgcgt ttcttacctc tggtctccat gagctcgtgt ccccgctggg tgacaaagag 5640 

gctgtccgtg tccccgtaga ccgactttat gggccggtcc tcgagcgggg tgccgcggtc 5700 

ctcgtcgtag aggaaccccg cccactccga gacgaaggcc cgggtccagg ccagcacgaa 5760 

ggaggccacg tgggaggggt agcggtcgtt gtccaccagc gggtccacct tctccagggt 5820 

atgcaagcac atgtccccct cgtccacatc caggaaggtg attggcttgt aagtgtaggc 5880 

cacgtgaccg ggggtcccgg ccgggggggt ataaaagggg gcgggcccct gctcgtcctc 5940 

actgtcttcc ggatcgctgt ccaggagcgc cagctgttgg ggtaggtatt ccctctcgaa 6000 

ggcgggcatg acctcggcac tcaggttgtc agtttctaga aacgaggagg atttgatatt 6060 

gacggtgccg ttggagacgc ctttcatgag cccctcgtcc atttggtcag aaaagacgat 6120 

ctttttgttg tcgagcttgg tggcgaagga gccgtagagg gcgttggaga gcagcttggc 6180 

gatggagcgc atggtctggt tcttttcctt gtcggcgcgc tccttggcgg cgatgttgag 6240 

ctgcacgtac tcgcgcgcca cgcacttcca ttcggggaag acggtggtga gctcgtcggg 6300 

cacgattctg acccgccagc cgcggttgtg cagggtgatg aggtccacgc tggtggccac 6360 

ctcgccgcgc aggggctcgt tggtccagca gaggcgcccg cccttgcgcg agcagaaggg 6420 

gggcagcggg tccagcatga gctcgtcggg ggggtcggcg tccacggtga agatgccggg 6480 

caggagctcg gggtcgaagt agctgatgca ggtgcccaga ttgtccagcg ccgcttgcca 6540 

gtcgcgcacg gccagcgcgc gctcgtaggg gctgaggggc gtgccccagg gcatggggtg 6600 

cgtgagcgcg gaggcgtaca tgccgcagat gtcgtagacg tagaggggct cctcgaggac 6660 

gccgatgtag gtggggtagc agcgcccccc gcggatgctg gcgcgcacgt agtcgtacag 6720 

ctcgtgcgag ggcgcgagga gccccgtgcc gaggttggag cgttgcggct tttcggcgcg 6780 

gtagacgatc tggcggaaga tggcgtggga gttggaggag atggtgggcc tttggaagat 6840 

gttgaagtgg gcgtggggca ggccgaccga gtccctgatg aagtgggcgt aggagtcctg 6900 

cagcttggcg acgagctcgg cggtgacgag gacgtccagg gcgcagtagt cgagggtctc 6960 
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ttggatgatg tcatacttga gctggccctt ctgcttccac agctcgcggt tgagaaggaa 7020 

ctcttcgcgg tccttccagt actcttcgag ggggaacccg tcctgatcgg cacggtaaga 7080 

gcccaccatg tagaactggt tgacggcctt gtaggcgcag cagcccttct ccacggggag 7140 

ggcgtaagct tgcgcggcct tgcgcaggga ggtgtgggtg agggcgaagg tgtcgcgcac 7200 

catgaccttg aggaactggt gcttgaagtc gaggtcgtcg cagccgccct gctcccagag 7260 

ttggaagtcc gtgcgcttct tgtaggcggg gttaggcaaa gcgaaagtaa catcgttgaa 7320 

gaggatcttg cccgcgcggg gcatgaagtt gcgagtgatg cggaaaggct ggggcacctc 7380 

ggcccggttg ttgatgacct gggcggcgag gacgatctcg tcgaagccgt tgatgttgtg 7440 

cccgacgatg tagagttcca cgaatcgcgg gcggcccttg acgtggggca gcttcttgag 7500 

ctcgtcgtag gtgagctcgg cggggtcgct gagcccgtgc tgctcgaggg cccagtcggc 7560 

gacgtggggg ttggcgctga ggaaggaagt ccagagatcc acggccaggg cggtctgcaa 7620 

gcggtcccgg tactgacgga actgttggcc cacggccatt ttttcggggg tgacgcagta 7680 

gaaggtgcgg gggtcgccgt gccagcggtc ccacttgagc tggagggcga ggtcgtgggc 7740 

gagctcgacg agcggcgggt ccccggagag tttcatgacc agcatgaagg ggacgagctg 7800 

cttgccgaag gaccccatcc aggtgtaggt ttccacatcg taggtgagga agagcctttc 7860 

ggtgcgagga tgcgagccga tggggaagaa ctggatctcc tgccaccagt tggaggaatg 7920 

gctgttgatg tgatggaagt agaaatgccg acggcgcgcc gagcactcgt gcttgtgttt 7980 

atacaagcgt ccgcagtgct cgcaacgctg cacgggatgc acgtgctgca cgagctgtac 8040 

ctgggttcct ttggcgagga atttcagtgg gcagtggagc gctggcggct gcatctcgtg 8100 

ctgtactacg tcttggccat cggcgtggcc atcgtctgcc tcgatggtgg tcatgctgac 8160 

gagcccgcgc gggaggcagg tccagacctc ggctcggacg ggtcggagag cgaggacgag 8220 

ggcgcgcagg ccggagctgt ccagggtcct gagacgctgc ggagtcaggt cagtgggcag 8280 

cggcggcgcg cggttgactt gcaggagctt ttccagggcg cgcgggaggt ccagatggta 8340 

cttgatctcc acggcgccgt tggtggctac gtccacggct tgcagggtgc cgtgcccctg 8400 

gggcgccacc accgtgcccc gtttcttctt gggcgctgct tccatgtcgg tcagaagcgg 8460 

cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 8520 

ggcacgtcgg cgccgcgcgc gggcaggttc tggtactgcg cccggagaag actggcgtga 8580 

gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc cacgggaccc 8640 

gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt gacggcggcc 8700 
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tgccgcagga 


tctcttgcac 


gtcgcccgag 


ttgtcctggt 


aggcgatctc 


ggtcatgaac 


8760 


tgctcgatct 


cctcctcctg 


aaggtctccg 


cggccggcgc 


gctcgacggt 


ggccgcgagg 


8820 


tcgttggaga 


tgcggcccat 


gagctgcgag 


aaggcgttca 


tgccggcctc 


gttccagacg 


8880 


cggctgtaga 


ccacggctcc 


gtcggggtcg 


cgcgcgcgca 


tgaccacctg 


ggcgaggttg 


8940 


agctcgacgt 


ggcgcgtgaa 


gaccgcgtag 


ttgcagaggc 


gctggtagag 


gtagttgagc 


9000 


gtggtggcga 


tgtgctcggt 


gacgaagaag 


tacatgatcc 


agcggcggag 


cggcatctcg 


9060 


ctgacgtcgc 


ccagggcttc 


caagcgttcc 


atggcctcgt 


agaagtccac 


ggcgaagttg 


9120 


aaaaactggg 


agttgcgcgc 


cgagacggtc 


aactcctcct 


ccagaagacg 


gatgagctcg 


9180 


gcgatggtgg 


cgcgcacctc 


gcgctcgaag 


gccccggggg 


gctcctcttc 


catctcctcc 


9240 


tcttcctcct 


ccactaacat 


ctcttctact 


tcctcctcag 


gaggcggtgg 


cgggggaggg 


9300 


gccctgcgtc 


gccggcggcg 


cacgggcaga 


cggtcgatga 


agcgctcgat 


ggtctccccg 


9360 


cgccggcgac 


gcatggtctc 


ggtgacggcg 


cgcccgtcct 


cgcggggccg 


cagcatgaag 


9420 


acgccgccgc 


gcatctccag 


gtggccgccg 


ggggggtctc 


cgttgggcag 


ggagagggcg 


9480 


ctgacgatgc 


atcttatcaa 


ttgacccgta 


gggactccgc 


gcaaggacct 


gagcgtctcg 


9540 


agatccacgg 


gatccgaaaa 


ccgctgaacg 


aaggcttcga 


gccagtcgca 


gtcgcaaggt 


9600 


aggctgagcc 


cggtttcttg 


ttcttcgggt 


atttggtcgg 


gaggcgggcg 


ggcgatgctg 


9660 


ctggtgatga 


agttgaagta 


ggcggtcctg 


agacggcgga 


tggtggcgag 


gagcaccagg 


9720 


tccttgggcc 


cggcttgctg 


gatgcgcaga 


cggtcggcca 


tgccccaggc 


gtggtcctga 


9780 


cacctggcga 


ggtccttgta 


gtagtcctgc 


atgagccgct 


ccacgggcac 


ctcctcctcg 


9840 


cccgcgcggc 


cgtgcatgcg 


cgtgagcccg 


aacccgcgct 


gcggctggac 


gagcgccagg 


9900 


tcggcgacga 


cgcgctcggt 


gaggatggcc 


tgctggatct 


gggtgagggt 


ggtctggaag 


9960 


tcgtcgaagt 


cgacgaagcg 


gtggtaggct 


ccggtgttga 


tggtgtagga 


gcagttggcc 


10020 


atgacggacc 


agttgacggt 


ctggtggccg 


ggtcgcacga 


gctcgtggta 


cttgaggcgc 


10080 


gagtaggcgc 


gcgtgtcgaa 


gatgtagtcg 


ttgcaggcgc 


gcacgaggta 


ctggtatccg 


10140 


acgaggaagt 


gcggcggcgg 


ctggcggtag 


agcggccatc 


gctcggtggc 


gggggcgccg 


10200 


ggcgcgaggt 


cctcgagcat 


gaggcggtgg 


tagccgtaga 


tgtacctgga 


catccaggtg 


10260 


atgccggcgg 


cggtggtgga 


ggcgcgcggg 


aactcgcgga 


cgcggttcca 


gatgttgcgc 


10320 


agcggcagga 


agtagttcat 


ggtggccgcg 


gtctggcccg 


tgaggcgcgc 


gcagtcgtgg 


10380 
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atgctctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440 

gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggctggag 10500 

ccgcagctaa cgtggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560 

ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620 

ggaaagcggc cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680 

cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740 

cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800 

ggagcgagcc cctctttttt tttcttgtgt ttttgccaga tgcatcccgt actgcggcag 10860 

atgcgccccc accctccacc acaaccgccc ctaccgcagc agcagcaaca gccggcgctt 10920 

ctgcccccgc cccagcagca gccagccact accgcggcgg ccgccgtgag cggagccggc 10980 

gttcagtatg acctggcctt ggaagagggc gaggggctgg cgcggctggg ggcgtqgtcg 11040 

ccggagcggc acccgcgcgt gcagatgaaa agggacgctc gcgaggccta cgtgcccaag 11100 

cagaacctgt tcagagacag gagcggcgag gagcccgagg agatgcgcgc ctcccgcttc 11160 

cacgcggggc gggagctgcg gcgcggcctg gaccgaaagc gggtgctgag ggacgaggat 11220 

ttcgaggcgg acgagctgac ggggatcagc cccgcgcgcg cgcacgtggc cgcggccaac 11280 

ctggtcacgg cgtacgagca gaccgtgaag gaggagagca acttccaaaa atccttcaac 11340 

aaccacgtgc gcacgctgat cgcgcgcgag gaggtgaccc tgggcctgat gcacctgtgg 11400 

gacctgctgg aggccatcgt gcagaacccc acgagcaagc cgctgacggc gcagctgttt 11460 

ctggtggtgc agcacagtcg ggacaacgag acgttcaggg aggcgctgct gaatatcacc 11520 

gagcccgagg gccgctggct cctggacctg gtgaacattt tgcagagcat cgtggtgcag 11580 

gagcgcgggc tgccgctgtc cgagaagctg gcggccatca acttctcggt gctgagtctg 11640 

ggcaagtact acgctaggaa gatctacaag accccgtacg tgcccataga caaggaggtg 11700 

aagatcgacg ggttttacat gcgcatgacc ctgaaagtgc tgaccctgag cgacgatctg 11760 

ggggtgtacc gcaacgacag gatgcaccgc gcggtgagcg ccagccgccg gcgcgagctg 11820 

agcgaccagg agctgatgca cagcctgcag cgggccctga ccggggccgg gaccgagggg 11880 

gagagctact ttgacatggg cgcggacctg cgctggcagc ccagccgccg ggccttggaa 11940 

gctgccggcg gttcccccta cgtggaggag gtggacgatg aggaggagga gggcgagtac 12000 

ctggaagact gatggcgcga ccgtattttt gctagatgca gcaacagcca ccgccgccgc 12060 

ctcctgatcc cgcgatgcgg gcggcgctgc agagccagcc gtccggcatt aactcctcgg 12120 
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acgattggac 


ccaggccatg 


caacgcatca 


tggcgctgac 

77 3 


gacccgcaat 


cccgaagcct 


12180 


ttagacagca 


gcctcaggcc 


aaccggctct 


cggccatcct 


ggagaccatcr 

37^*33 7 3 


gtgccctcgc 


12240 


gctcgaaccc 


cacgcacgag 


aaggtgctgg 


ccatcgtgaa 


cgcgctgqtq 

7 7 73 3 


gagaacaagg 

7 3 7 3 


12300 


ccatccgcgg 


tgacgaggcc 


gggctggtgt 

»i7^ 37i7 3 


acaacgcgct 


gctggagcgc 

3^ w 33^*3^7 


gtggcccgct 

3^3 3^^ 3 


12360 


acaacagcac 


caacgtgcag 


acgaacctgg 


accgcatggt 


gaccgacgtg 

3 3 3 7 


cgcgaggcgg 

3 3 7 3 3 3 


12420 


tgtcacaacg 

7 '" 3J 3 


cgagcggttc 


caccgcgagt 


coaacctggg 


ctccatggtg 


gcgctgaacg 


12480 


ccttcctaaa 


cacgcagccc 


crccaaccrtcfc 


cccaaaacca 


aaaaaactac 

*J ** \£ *0 w ** w 


accaacttca 


12540 


tcagcgcgct 


accractcrata 

3 w 3 57 ^ 3 ** ^ 3 


ataaccaaaa 


tgccccagag 


caaaatatac 

3 7 w 3 w " w 


caqtcqqqqc 

7 *' v 7333 w 


12600 


cggactactt 


cttccagacc 


agtcgccagg 

7 ~" 3 3 3 


gcttgcagac 


cgtgaacctg 


agccaggctt 


12660 


tcaagaactt 


gcagggactg 


tggqgcgtgc 

3333^3 3 


aqqccccqqt 


cqqqqaccqc 

w 73333 


gcgacggtgb 

3 3 33 7 


12720 


cgagcetgct 


gacgccgaac 


tcgcgcctgc 


tgctgctgct 


ggtggcgccc 


ttcacggaca 


12780 




aaoccacaac 


tcgtacctgg 


gctacctgct 


taacctgtac 


cgcgaggcca 


12840 


tcaaacaaqc 


crcacatcraac 


gagcagacct 


accaggagat 


cacccacgtg 


agccgcgcgc 

7 w 3 3 3 


12900 


taaaccaaaa 


*^ >j v 


aacctaaaaq 


ccaccctgaa 


cttcctgctg 


accaaccggt 


12960 


cacaoaaaat 


cccaccccao 


tacacactaa 


Qcaccoactaa 


aaaacacatc 

7 7 u 3 w 7 W5 * w w 


ctgcgctacg 


13020 


tgcagcagag 


cgtggggctg 

^3 '■»3333^> , ^3 


ttcctgatgc 


aggaggqggc 

3337333 


cacgcccagc 


gcggcgctcg 

3 33-"3'^*- 3 


13080 


acatgaccgc 


gcgcaacatg 


gagcccagca 


tgtacgcccg 


caaccgcccg 


ttcatcaata 


13140 


agctgatgga 


ctacttgcat 


cgggcggccg 

333^33 7 


ccatgaactc 


ggactacttt 


accaacgcca 


13200 


tcttgaaccc 


gcactggctc 


ccgccgcccg 


ggttctacac 


gggcgagtac 


gacatgcccg 


13260 


accccaacga 


cgggttcctg 

3 3 3 ~ 


tgggacgacg 

3 3 5? 3 3 


tggacagcag 


cgtgttctcg 


ccgcgtccag 


13320 


gaaccaatgc 


cgtgtggaag 

i7 3 3 3 3 


aaagagggcg 


gggaccggcg 

7 3 7 7 3 3 


gccgtcctcg 


gcgctgtccg 

3 3 3 3 


13380 


gtcgcgcggg 

3 3 3 a? s 


tgctgccgcg 

^3 T3 *7 s7 


gcggtgcccg 

3 3 3 ^3 w 3 


aggccgccag 

73 ^ 3 ^ ™" ^ 


ccccttcccg 


agcctgccct 


13440 


tttcgctgaa 


caqcatqcqc 

W **» W T7 V ~ 


agcagcgagc 

3 7 3 3 


tgggtcggct 


gacgcgaccg 


cgcctgctgg 


13500 


acaaqqaqaa 


gtacctgaac 


gactccttgt 


tgaggcccga 


gcgcgagaag 


aacttcccca 


13560 


ataacgggat 


agagagcctg 


gtggacaaga 


tgagccgctg 


gaagacgtac 


gcgcacgagc 


13620 


acagggacga 


gccccgagct 


agcagcgcag 


gcacccgtag 


acgccagcgg 


cacgacaggc 


13680 


agcggggact 


ggtgtgggac 


gatgaggatt 


ccgccgacga 


cagcagcgtg 


ttggacttgg 


13740 


gtgggagtgg 


tggtaacccg 


ttcgctcacc 


tgcgcccccg 


tatcgggcgc 


ctgatgtaag 


13800 
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aatctgaaaa aataaaagac ggtactcacc aaggccatgg cgaccagcgt gcgttcttct 13860 

ctgttgtttg tagtagt atg atg agg cgc gtg tac ccg gag ggt cct cct 13910 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro 
1 5 10 

ccc teg tac gag age gtg atg cag cag gcg gtg gcg gcg gcg atg cag 13958 
Pro Ser Tyr Glu Ser Val Met Gin Gin Ala Val Ala Ala Ala Met Gin 
15 20 25 



ccc ccg ctg gag gcg cct tac gtg ccc ccg egg tac ctg gcg cct acg 
Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr 
30 35 40 



gec ate ate gac aac tac ttg gcg gtg ggg egg cag aac ggg gtg ctg 
Ala He He Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu 
190 195 200 



14006 



14150 



14198 



gag ggg egg aac age att cgt tac teg gag ctg gca ccc ttg tac gat 14054 
Glu Gly Arg Asn Ser He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp 
45 50 55 

acc acc egg ttg tac ctg gtg gac aac aag teg gca gac ate gec teg 14102 
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser 
60 " 65 70 75 

ctg aac tac cag aac gac cac age aac ttc ctg acc acc gtg gtg cag 
Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin 
80 85 90 

aac aac gat ttc acc ccc acg gag gec age acc cag acc ate aac ttt 
Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe 
95 100 105 

gac gag cgc teg egg tgg ggc ggc cag ctg aaa acc ate atg cac acc 14246 
Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys Thr He Met His Thr 
110 " 115 120 

aac atg ccc aac gtg aac gag ttc atg tac age aac aag ttc aag gcg 14294 
Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala 
125 130 135 

egg gtg atg gtc teg cgc aag acc ccc aac ggg gtg gat gat gat tat 14342 
Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr 
140 145 150 155 

gat ggt agt cag gac gag ctg acc tac gag tgg gtg gag ttt gag ctg 14390 
Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu 
160 165 170 

ccc gag ggc aac ttc teg gtg acc atg acc ate gat ctg atg aac aac 14438 
Pro Glu Gly Asn Phe Ser Val Thr Met Thr He Asp Leu Met Asn Asn 
175 180 185 



14486 



gag age gac ate ggc gtg aag ttc gac acg cgc aac ttc egg ctg ggc 14534 
Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly 
205 * 210 215 
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tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc aac gag 14582 
Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu 
220 225 230 235 



gcc ttc cac ccc gac ate gtc ctg ctg ccc ggc tgc ggc gtg gac ttc 
Ala Phe His Pro Asp lie Val Leu Leu Pro Gly Cys Gly Val Asp Phe 



240 245 250 



14630 



acc gag age cgc etc age aac ctg ctg ggc ate cgc aag egg cag ccc 14678 
Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Gin Pro 

255 260 "* 265 

ttc cag gag ggc ttc cag ate ctg tac gag gac ctg gag ggg ggc aac 14726 
Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp Leu Glu Gly Gly Asn 
270 275 280 

ate ccc gcg etc ttg gat gtc gaa gcc tac gag aaa age aag gag gat 14774 
He Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp 
285 290 295 

age acc gcc gcg gcg acc gca gcc gtg gcc acc gcc tct acc gag gtg 14822 
Ser Thr Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val 
300 305 310 315 

egg ggc gat aat ttt get age get gcg gca gcg gcc gag gcg get gaa 14870 
Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu 
320 325 330 

acc gaa agt aag *ata gtc ate cag ccg gtg gag aag gac age aag gac 14918 
Thr Glu Ser Lys He Val He Gin Pro Val Glu Lys Asp Ser Lys Asp 
335 340 345 

agg age tac aac gtg etc gcg gac aag aaa aac acc gcc tac cgc age 14966 
Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser 
350 355 360 

tgg tac ctg gcc tac aac tac ggc gac ccc gag aag ggc gtg cgc tec 15014 
Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser 
365 370 375 

tgg acg ctg etc acc acc teg gac gtc acc tgc ggc gtg gag caa gtc 15062 
Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val 
38 ° 385 390 395 

tac tgg teg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc tec 15110 
Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser 
400 405 410 

acg cgt caa gtt age aac tac ccg gtg gtg ggc gcc gag etc ctg ccc 15158 
Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro 
415 420 425 

gtc tac tec aag age ttc ttc aac gag cag gcc gtc tac teg cag cag 15206 
Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin 
430 435 440 
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ctg cgc gcc ttc acc teg etc acg cac gtc ttc aac cgc ttc ccc gag 15254 
Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu 
445 450 455 

aac cag ate etc gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt 15302 
Asn Gin lie Leu Val Arg Pro Pro Ala Pro Thr lie Thr Thr Val Ser 
460 465 470 475 

gaa aac gtt cct get etc aca gat cac ggg acc ctg ccg ctg cgc age 15350 
Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser 
480 485 490 

agt ate egg gga gtc cag cgc gtg acc gtc act gac gcc aga cgc cgc 15398 
Ser He Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg 
495 500 505 

acc tgc ccc tac gtc tac aag gcc ctg ggc gta gtc gcg ccg cgc gtc 15446 
Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val 
510 " 515 520 

etc teg age cgc acc ttc taa aaaatgtcca ttctcatctc geccagtaat 15497 
Leu Ser Ser Arg Thr Phe 



525 














aaca ccggtt 


crcrcrcfcctaccf 


cacacccaac 


aaaatataco 


aacrcrcactca 


ccaacgctcc 


15557 


accrcaacacc 


cccrtcrccrccft 


GCGcaoci cac 


ttccgcgctc 


cctaaaacac 
3 333 w y w 


cctcaaocKic 


15617 




cacacaccac 


catccaccac 


gtgatcgacc 


aaatcrcrtcrcfc 
a yy u yy ^sy^ 


caaccrcacac 

^3 ^3 w 3 w 3 ^" 


15677 


aactacacgc 


ccgccgccgc 


gcccgtctcc 


accgtggacg 


ccgtcatcga 


cagcgtggtg 


15737 


gccgacgcgc 


gccggtacgc 


ccgcaccaag 


agccggcggc 


ggcgcatcgc 


ccggcggcac 


15797 


cggagcaccc 


ccgccatgcg 


cgcggcgcga 


gccttgctgc 


geagggecag 


gcgcacggga 


15857 


cgcagggcca 


tgetcaggge 


ggccagacgc 


gcggcctccg 


gcagcagcag 


cgccggcagg 


15917 


acccgcagac 


gcgcggccac 


ggcggcggcg 


gcggccatcg 


ccagcatgtc 


ccgcccgcgg 


15977 


cgcggeaacg 


tgtactgggt 


gcgcgacgcc 


gccaccggtg 


tgcgcgtgcc 


cgtgcgcacc 


16037 


cgcccccctc 


gcacttgaag 


atgetgaett 


cgcgatgttg 


atgtgtccca 


gcggcgagga 


16097 


ggatgtccaa 


gegcaaatae 


aaggaagaga 


tgctccaggt 


catcgcgcct 


gagatctacg 


16157 


gccccgcggc 


ggcggtgaag 


gaggaaagaa 


agccccgcaa 


actgaagegg 


gtcaaaaagg 


16217 


acaaaaagga 


ggaggaagat 


gaeggactgg 


tggagtttgt 


gcgcgagttc 


gccccccggc 


16277 


ggcgcgtgca 


gtggcgcggg 


cggaaagtga 


aaceggtget 


gcggcccggc 


accacggtgg 


16337 


tcttcacgcc 


eggegagegt 


tccggctccg 


cctccaagcg 


ctcctacgac 


gaggtgtacg 


16397 


gggacgagga 


catcctcgag 


caggeggteg 


agcgtctggg 


cgagtttgcg 


taeggcaage 


16457 


gcagccgccc 


cgcgcccttg 


aaagaggagg 


cggtgtccat 


cccgctggac 


cacggcaacc 


16517 
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ccacgccgag cctgaagccg gtgaccctgc agcaggtgct accgagcgcg gcgccgcgcc 16577 

ggggcttcaa gcgcgagggc ggcgaggatc tgtacccgac catgcagctg atggtgccca 16637 

agcgccagaa gctggaggac gtgctggagc acatgaaggt ggaccccgag gtgcagcccg 16697 

aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct gggcgtgcag accgtggaca 16757 

tcaagatccc cacggagccc atggaaacgc agaccgagcc cgtgaagccc agcaccagca 16817 

ccatggaggt gcagacggat ccctggatgc cagcaccagc ttccaccagc actcgccgaa 16877 

gacgcaagta cggcgcggcc agcctgctga tgcccaacta cgcgctgcat ccttccatca 16937 

tccccacgcc gggctaccgc ggcacgcgct tctaccgcgg ctacaccagc agccgccgcc 16997 

gcaagaccac cacccgccgc cgtcgtcgca gccgccgcag cagcaccgcg acttccgcct 17057 

tggtgcggag agtgtatcgc agcgggcgcg agcctctgac cctgccgcgc gcgcgctacc 17117 

acccgagcat cgccatttaa ctaccgcctc ctacttgcag atatggccct cacatgccgc 17177 

ctccgcgtcc ccattacggg ctaccgagga agaaagccgc gccgtagaag gctgacgggg 17237 

aacgggctgc gtcgccatca ccaccggcgg cggcgcgcca tcagcaagcg gttgggggga 17297 

ggcttcctgc ccgcgctgat ccccatcatc gccgcggcga tcggggcgat ccccggcata 17357 

gcttccgtgg cggtgcaggc ctctcagcgc cactgagaca caaaaaagca tggatttgta 17417 

ataaaaaaaa aaatggactg acgctcctgg tcctgtgatg tgtgttttta gatggaagac 17477 

atcaattttt cgtccctggc accgcgacac ggcacgcggc cgtttatggg cacctggagc 17537 

gacatcggca acagccaact gaacgggggc gccttcaatt ggagcagtct ctggagcggg 17597 

cttaagaatt tcgggtccac gctcaaaacc tatggcaaca aggcgtggaa cagcagcaca 17657 

gggcaggcgc tgagggaaaa gctgaaagaa cagaacttcc agcagaaggt ggttgatggc 17717 

ctggcctcag gcatcaacgg ggtggttgac ctggccaacc aggccgtgca gaaacagatc 17777 

aacagccgcc tggacgcggt cccgcccgcg gggtccgtgg agatgcccca ggtggaggag 17837 

gagctgcctc ccctggacaa gcgcggcgac aagcgaccgc gtcccgacgc ggaggagacg 17897 

ctgctgacgc acacggacga gccgcccccg tacgaggagg cggtgaaact gggcctgccc 17957 

accacgcggc ccgtggcgcc tctggccacc ggagtgctga aacccagcag cagccagccc 18017 

gcgaccctgg acttgcctcc gcctcgcccc tccacagtgg ctaagcccct gccgccggtg 18077 

gccgtcgcgt cgcgcgcccc ccgaggccgc ccccaggcga actggcagag cactctgaac 18137 

agcatcgtgg gtctgggagt gcagagtgtg aagcgccgcc gctgctatta aaagacactg 18197 
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tagcgcttaa cttgcttgtc tgtgtgtata tgtatgtccg ccgaccagaa ggaggagtgt 18257 

gaagaggcgc gtcgccgagt tgcaag atg gcc acc cca teg atg ctg ccc cag 18310 

Met Ala Thr Pro Ser Met Leu Pro Gin 
530 535 

tgg gcg tac atg cac ate gcc gga cag gac get teg gag tac ctg agt 18358 
Trp Ala Tyr Met His lie Ala Gly Gin Asp Ala Ser Glu Tyr Leu Ser 
540 545 550 

ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt ctg 18406 
Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu 
555 560 565 570 

ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg acc 18454 
Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val Thr 
575 580 585 

acc gac cgc age cag egg ctg acg ctg cgc ttc gtg ccc gtg gac cgc 18502 
Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg 
590 595 600 

gag gac aac acc tac teg tac aaa gtg cgc tac acg ctg gcc gtg ggc 18550 
Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly 
605 610 615 

gac aac cgc gtg ctg gac atg gcc age acc tac ttt gac ate cgc ggc 18598 
Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp He Arg Gly 
620 6.25 630 

gtg ctg gac egg ggc cct age ttc aaa ccc tac tct ggc acc gcc tac 18646 
Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr 
635 640 ' 645 ~ 650 

aac age eta get ccc aag gga get ccc aat tec age cag tgg gag caa 18 694 
Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Ser Ser Gin Trp Glu Gin 
655 660 665 

gca aaa aca ggc aat ggg gga act atg gaa aca cac aca tat ggt gtg 18742 
Ala Lys Thr Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val 
670 675 680 

gcc cca atg ggc gga gag aat att aca aaa gat ggt ctt caa att gga 18790 
Ala Pro Met Gly Gly Glu Asn He Thr Lys Asp Gly Leu Gin He Gly 
685 690 695 

act gac gtt aca gcg aat cag aat aaa cca att tat gcc gac aaa aca 18838 
Thr Asp Val Thr Ala Asn Gin Asn Lys Pro He Tyr Ala Asp Lys Thr 
700 705 710 

ttt caa cca gaa ccg caa gta gga gaa gaa aat tgg caa gaa act gaa 18886 
Phe Gin Pro Glu Pro Gin Val Gly Glu Glu Asn Trp Gin Glu Thr Glu 
715 720 725 730 

aac ttt tat ggc ggt aga get ctt aaa aaa gac aca aac atg aaa cct 18934 
Asn Phe Tyr Gly Gly Arg Ala Leu Lys Lys Asp Thr Asn Met Lys Pro 
735 740 745 
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tgc tat ggc tec tat get aga ccc acc aat gaa aaa gga ggt caa get 18982 
Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala 
750 755 760 

aaa ctt aaa gtt gga gat gat gga gtt cca acc aaa gaa ttc gac ata 19030 
Lys Leu Lys Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp lie 
765 770 ' 775 

gac ctg get ttc ttt gat act ccc ggt ggc acc gtg aac ggt caa gac 19078 
Asp Leu Ala Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gin Asp 
780 785 790 

gag tat aaa gca gac att gtc atg tat acc gaa aac acg tat ttg gaa 19126 
Glu Tyr Lys Ala Asp lie Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu 
795 800 805 810 

act cca gac acg cat gtg gta tac aaa cca ggc aag gat gat gca agt 19174 
Thr Pro Asp Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser 
815 820 825 

tct gaa att aac ctg gtt cag cag tct atg ccc aac aga ccc aac tac 19222 
Ser Glu lie Asn Leu Val Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr 
830 835 840 

att ggg ttc agg gac aac ttt ate ggt ctt atg tac tac aac age act 19270 
lie Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr Asn Ser Thr 
845 850 855 

ggc aat atg ggt gtg ctt get ggt cag gec tec cag ctg aat get gtg 19318 
Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val 
860 865 870 

gtt gat ttg caa gac aga aac acc gag ctg tec tac cag etc ttg ctt 19366 
Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu 
875 880 885 890 

gac tct ttg ggt gac aga acc egg tat ttc agt atg tgg aac cag gcg 19414 
Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn ,Gln Ala 
895 900 905 

gtg gac agt tat gac ccc gat gtg cgc ate ate gaa aac cat ggt gtg 19462 
Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His Gly Val 
910 915 920 

gag gat gaa ttg cca aac tat tgc ttc ccc ttg gac ggc tct ggc act 19510 
Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr 
925 930 935 

aac gee gca tac caa ggt gtg aaa gta aaa gat ggt caa gat ggt gat 19558 
Asn Ala Ala Tyr Gin Gly Val Lys Val Lys Asp Gly Gin Asp Gly Asp 
940 945 950 

gtt gag agt gaa tgg gaa aat gac gat act gtt gca get cga aat caa 19606 
Val Glu Ser Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gin 
955 960 965 970 
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tta tgt aaa ggt aac att ttc gcc atg gag att aat etc cag get aac 
Leu Cys Lys Gly Asn lie Phe Ala Met Glu lie Asn Leu Gin Ala Asn 
975 980 985 



19654 



ctg tgg aga agt ttc etc tac teg aac gtg gcc ctg tac ctg ccc gac 
Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp 
990 995 1000 



19702 



tec tac aag tac acg ccg acc aac gtc acg ctg ccg acc aac acc 
Ser Tyr Lys Tyr Thr Pro Thr Asn Val Thr Leu Pro Thr Asn Thr 
1005 1010 1015 



19747 



aac acc tac gat tac atg aat ggc aga gtg aca cct ccc teg ctg 
Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Thr Pro Pro Ser Leu 
1020 1025 1030 



19792 



gta gac gcc tac etc aac .ate ggg gcg cgc tgg teg ctg gac ccc 
Val Asp Ala Tyr Leu Asn lie Gly Ala Arg Trp Ser Leu Asp Pro 
1035 1040 1045 



19837 



atg gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc ctg 
Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu 
1050 1055 1060 



19882 



cgc tac cgc tec atg etc ctg ggc aac ggg cgc tac gtg ccc ttc 
Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe 
1065 1070 ~ 1075 



19927 



cac ate cag gtg ccc caa aag ttt ttc gcc ate aag age etc ctg 
His He Gin Val Pro Gin Lys Phe Phe Ala He Lys Ser Leu Leu 
1080 1085 1090 



19972 



etc ctg ccc ggg tec tac acc tac gag tgg aac ttc cgc aag gac 
Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp 
1095 1100 1105 



20017 



gtc aac atg ate ctg cag age tec eta ggc aac gac ctg cgc acg 
Val Asn Met He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Thr 
1110 1115 1120 



20062 



gac ggg gcc tec ate gcc ttc acc age ate aac etc tac gcc acc 
Asp Gly Ala Ser He Ala Phe Thr Ser He Asn Leu Tyr Ala Thr 
1125 1130 1135 



20107 



ttc ttc ccc atg gcg cac aac acc gcc tec acg etc gag gcc atg 
Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met 
1140 1145 1150 



20152 



ctg cgc aac gac acc aac gac cag tec ttc aac gac tac etc teg 
Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser 
1155 1160 1165 



20197 



gcg gcc aac atg etc tac ccc ate 
Ala Ala Asn Met Leu Tyr Pro He 
1170 1175 



ccg gcc aac gcc acc aac gtg 
Pro Ala Asn Ala Thr Asn Val 
1180 



20242 
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ccc ate tec ate cr-c teg cgc aac tgg gec gec ttc cgc gga tgg 
Pro lie Ser He Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp 
1185 1190 1195 



20287 



tec ttc acg cgc ctg aag acc cgc gag acg ccc teg etc ggc tec 
Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser 
1200 1205 1210 



20332 



ggg ttc gac 
Gly Phe Asp 
1215 



ccc tac ttc gtc tac teg ggc tec ate ccc tac eta 
Pro Tyr Phe Val Tyr Ser Gly Ser He Pro Tyr Leu 
1220 1225 



20377 



gac ggc acc ttc tac etc aac cac acc ttc aag aag gtc tec ate 
Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He 
1230 1235 1240 



20422 



acc ttc gac tec tec gtc age tgg ccc ggc aac gac cgc etc ctg 
Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu 
1245 1250 1255 



20467 



acg ccc aac gag ttc gaa ate aag cgc acc gtc gac gga gag gga 
Thr Pro Asn Glu Phe Glu He Lys Arg Thr Val Asp Gly Glu Gly 
1260 1265 1270 



20512 



tac aac gtg gee cag tgc aac atg acc aag gac tgg ttc ctg gtc 
Tyr Asn Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val 
1275 1280 1285 



20557 



cag atg ctg gec cac tac aac ate 
Gin Met Leu Ala His Tyr Asn He 
1290 1295 



ggc tac cag ggc ttc tac gtg 
Gly Tyr Gin Gly Phe Tyr Val 
1300 



20602 



ccc gag ggc 
Pro Glu Gly 
1305 



tac aag gac cgc atg tac tec ttc ttc cgc aac ttc 
Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe 
1310 1315 



20647 



cag ccc atg age cgc cag gtc gtg gac gag gtc aac tac aag gac 
Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn Tyr Lys Asp 
1320 1325 1330 



20692 



tac cag gee gtc acc ctg gec tac cag cac aac aac teg ggc ttc 
Tyr Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser Gly Phe 
1335 1340 1345 



20737 



gtc ggc tac etc gcg ccc acc atg cgc cag ggc cag ccc tac ccc 
Val Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr Pro 
1350 1355 1360 



20782 



gec aac tac ccc tac ccg etc ate ggc aag age gec gtc gee age 
Ala Asn Tyr Pro Tyr Pro Leu He Gly Lys Ser Ala Val Ala Ser 
1365 1370 1375 



20827 



gtc acc cag aaa aag ttc etc tgc gac egg gtc atg tgg cgc ate 
Val Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg He 
1380 1385 1390 



20872 
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ccc ttc tec age aac ttc atg tec atg ggc gcg etc acc gac etc 20917 
Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu 
1395 1400 1405 

ggc cag aac atg etc tac gee aac tec gee cac gcg eta gac atg 20962 
Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met 
1410 1415 1420 

aat ttc gaa gtc gac ccc atg gat gag tec acc ctt etc tat gtt 21007 
Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val 
1425 1430 1435 

gtc ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc 21052 
Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His Arg 
1440 1445 1450 

ggc gtc ate gaa gec gtc tac ctg cgc acg ccc ttc teg gee ggc 21097 
Gly Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly 
1455 1460 1465 

aac gee acc acc taa gccgctcttg ettcttgeaa gatgaeggeg ggctccggcg 21152 
Asn Ala Thr Thr 
1470 



agcaggagct 


cagggccatc 


ctccgcgacc 


tgggctgegg 


gccctgcttc ctgggcacct 


21212 


tegacaageg 


cttccctgga 


ttcatggccc 


cgcacaagct 


ggcctgcgcc ategtgaaca 


21272 


cggccggccg 


egagaceggg 


ggfcgagcact 


ggctggcctt 


cgcctggaac .ccgcgctccc 


21332 


acacatgeta 


cctcttcgac 


cccttcgggt 


teteggaega 


gcgcctcaag cagatctacc 


21392 


agttcgagta 


egagggectg 


ctgcgtcgca 


gcgccctggc 


caecgaggac cgctgcgtca 


21452 


ccctggaaaa 


gtccacccag 


acegtgeagg 


gtccgcgctc 


ggccgcctgc gggctcttct 


21512 


gctgcatgtt 


cctgcacgcc 


ttcgtgcact 


ggcccgaecg 


ccccatggac aagaacccca 


21572 


ccatgaactt 


actgaegggg 


gtgcccaacg 


gcatgctcca 


gtcgccccag gtggaaccca 


21632 


ccctgcgccg 


caaccaggaa 


gcgctctacc 


gcttcctcaa 


tgcccactcc gcctactttc 


21692 


gctcccaccg 


cgcgcgcatc 


gagaaggcea 


ccgccttcga 


ccgcatgaat caagacatgt 


21752 


aaaaaacegg 


tgtgtgtatg 


tgaatgcttt 


attcataata 


aacagcacat gtttatgeca 


21812 


ccttctctga 


ggctctgact 


ttatttagaa 


atcgaagggg 


ttctgccggc teteggcatg 


21872 


gcccgcgggc 


agggatacgt 


tgcggaactg 


gtacttgggc 


agecacttga acteggggat 


21932 


cagcagcttg 


ggcacgggga 


ggtcggggaa 


cgagtcgctc 


cacagcttgc gcgtgagttg 


21992 


cagggcgccc 


ageaggtegg 


gegeggagat 


cttgaaatcg 


cagttgggac ccgcgttctg 


22052 


cgegegagag 


ttgeggtaca 


eggggttgea 


gcactggaac 


accatcaggg ccgggtgctt 


22112 


cacgcttgcc 


agcaccgtcg 


cgtcggtgat 


gccctccacg 


tccagatcct cggcgttggc 


22172 
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catcccgaag 


ggggtcatct 


tgcaggtctg 


ccgccccatg 


ctgggcacgc 


agccgggctt 


22232 


gtggttgcaa 


tcgcagtgca 


gggggatcag 


catcatctgg 


gcctgctcgg 


agctcatgcc 


22292 


cgggtacatg 


gccttcatga 


aagcctccag 


ctggcggaag 


gcctgctgcg 


ccttgccgcc 


22352 


ctcggtgaag 


aagaccccgc 


aggacttgct 


agagaactgg 


ttggtggcgc 


agccggcgtc 


22412 


gtgcacgcag 


cagcgcgcgt 


cgttgttggc 


cagctgcacc 


acgctgcgcc 


cccagcggtt 


22472 


ctgggtgatc 


ttggcccggt 


tggggttctc 


cttcagcgcg 


cgctgcccgt 


tctcgctcgc 


22532 


cacatccatc 


tcgatagtgt 


gctccttctg 


gatcatcacg 


gtcccgtgca 


ggcaccgcag 


22592 


cttgccctcg 


gcttcggtgc 


agccgtgcag 


ccacagcgcg 


cagccggtgc 


actcccagtt 


22652 


cttgtgggcg 


atctgggagt 


gcgagtgcac 


gaagccctgc 


aggaagcggc 


ccatcatcgc 


22712 


ggtcagggtc 


ttgttgctgg 


tgaaggtcag 


cgggatgccg 


cggtgctcct 


cgttcacata 


22772 


caggtggcag 


atgcggcggt 


acacctcgcc 


ctgctcgggc 


atcagctgga 


aggcggactt 


22832 


caggtcgctc 


tccacgcggt 


accggtccat 


cagcagcgtc 


atcacttcca 


tgcccttctc 


22892 


ccaggccgaa 


acgatcggca 


ggctcagggg 


gttcttcacc 


gccattgtca 


tcttagtcgc 


22952 


cgccgccgag 


gtcagggggt 


cgttctcgtc 


cagggtctca 


aacactcgct 


tgccgtcctt 


23012 


ctcgatgatg 


cgcacggggg 


gaaagctgaa 


gcccacggcc 


gccagctcct 


cctcggcctg 


23072 


cctttcgtcc 


tcgctgtcct 


ggctgatgtc 


ttgcaaaggc 


acatgcttgg 


tcttgcgggg 


23132 


tttctttttg 


ggcggcagag 


gcggcggcga 


tgtgctggga 


gagcgcgagt 


tctcgttcac 


23192 


cacgactatt 


tcttcttctt 


ggccgtcgtc 


cgagaccacg 


cggcggtagg 


catgcctctt 


23252 


ctggggcaga 


ggcggaggcg 


acgggctctc 


gcggttcggc 


gggcggctgg 


cagagccccf 


23312 


tccgcgttcg 


ggggtgcgct 


cctggcggcg 


ctgctctgac 


tgacttcctc 


cgcggccggc 


23372 


cattgtgttc 


tcctagggag 


caacaacaag 


catggagact 


cagccatcgt 


cgccaacatc 


23432 


gccatctgcc 


cccgccgcca 


ccgccgacga 


gaaccagcag 


cagaatgaaa 


gcttaaccgc 


23492 


cccgccgccc 


agccccacct 


ccgacgccgc 


ggccccagac 


atgcaagaga 


tggaggaatc 


23552 


catcgagatt 


gacctgggct 


acgtgacgcc 


cgcggagcac 


gaggaggagc 


tggcagcgcg 


23612 


cttttcagcc 


ccggaagaga 


accaccaaga 


gcagccagag 


caggaagcaq 


agaacgagca 


23672 


gaaccaggct 


gggcacgagc 


atggcgacta 


cctgagcggg 


gcagaggacg 


tgctcatcaa 


23732 


gcatctggcc 


cgccaatgca 


tcatcgtcaa 


ggacgcgctg 


ctcgaccgcg 


ccgaggtgcc 


23792 


cctcagcgtg 


gcggagctca 


gccgcgccta 


cgagcgcaac 


ctcttctcgc 


cgcgcgtgcc 


23852 
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ccccaagcgc 


cagcccaacg 


gcacctgtga 


gcccaacccg 


cgcctcaact 


tctacccggt 


23912 


cttcacacftcf 

3 ^73 **y 


cccgaggccc 


tggccaccta 


ccacctcttt 


ttcaagaacc 


aaaggatccc 


23972 


catctcctac 

V^ VJ v »»• w V- M V^ 


caccr ccaacc 


crcaccccrccrc 


cgacgccctg 


Cwcaacctocr 


acccccfcrccfc 


24032 




crata tcacct 

y a w wi w v* w w w 


c ct t o era a era 


aattcccaaa 


atctt caacrcf 

o wwwwwyoyy 


cf t ct cf ci cr ca q 

y ww wy y y w«*\^ 


24092 




PCfCfCf CCCf fda 
w y y y uv*y y a 


a ccrct!C! , f"Cfc , a 


acraaaaccf cr a 


cr a n era cr c* a t cr 
y ay y oy wa wy 


acrcaccacao 

wiy w»o w Wu wj a u 


24152 


ccr ccc tcfertcr 


aaattaaaaa 

y WNJ W ^^y 


Qccracaaccfc 


y wy wwuy y ^y 


Cftcctcaacfc 


acacaatcaa 

y wowyy wwyo 


24212 


gc tgacccac 


ttcgcctacc 


cggcgctcaa 


cctgcccccc 


aaggtcatga 


gcgccgtcat 


24272 


oaaccaacitcr 


ctcatcaagc 


gcgcctcgcc 


cctctcggag 


aaaaaaatac 

yoyyoyo wy w 


aaaacccccja 


24332 


y O V4 V W VH y w» 


aaaaacaaac 

y My y y wwv wiy w 


ccataatcao 


caacaaacacr 

w \d v* *4 ** >j 


eta a co ccfet 


aactcraoacfc 

yy w wy yy oy w 


24392 


ciacztacicacc 

y a y v o y w> c* w w 


ccccaoaacc 

Vrf V^> U U V* ^ W w 


t cr cr a a a a cr c a 


cfcacaaactc 

y «u wo oy w w 


at cf a taacca 

w\ w w o wy y w wy 


tcfcrtcctcrcft 

wyy www wyy w 


24452 


aaccataaaa 


ctaaaatatc 


tgcgccgctt 


ctttgccgac 


crcaaaaaccc 

y *"y y oyowww 


tcfcacaaocrt 


24512 


caaaaaaaac 


ctgcactacc 


tcttcaggca 


caaattcuto 

wy yy w w wy wy 


cgccaggcct 


gcaagatctc 


24572 


woowy r j y 


ctcfaccaacc 

w wy a w wnnvv 


tcrort c t cc ta 


cataaaca tc 

w w^yy w^* w w 


ctacacoaaa 


acccrcctcrcfGf 

owwy wwwy yy 


24632 


acaaaacata 


ctccacacca 


ccctacacaa 


aaaaccccac 

y y oyy w«-wy w» 


cgcgactaca 


tccgcgactg 


24692 


ccrtctaccfccf 

wy wwwowwwy 


ta cctctcrcc 

WO Wj bwwy w W 


acacctcrcfca 


ciacacfc/cat cf 


cr cr ccr t cf t cr a c 

yy wy wy wyy w 


acfcacrtcfcct 

a 3 VB 3 wy www 


24752 


y y oyy ay way 


aarrfnaaaa 


acfctctcrcaa 

ay www L>y wem 


ywwwwwy wo y 


aacraacctca 

wl SL W uLU W W W W* wi 


aaaccctcftcr 

wi w w www by wy 


24812 


cr a c ccr cr er t 1 e 

yavvyyy w w w 


era ccs Aaca ta 

yowyoywy wo 


ccacccfcctc 

w wo wv*y vWVa^ 


cr cr a c c t cro c c 


cacctcatct 

VJ V* V^ V* V^ V4 w V W 


tccccaacfca 

ww ww wy wi y w- y 


24872 


ccterccrcictcr 

W w Wy Wy y W- W y 


a cere tcrccica 

U V* V. y w>y wB 


acaacrctacc 


caactttata 

W V* W WWW %m& W W 


aaccaaarjea 

V4 VJ Vj* V-* VI V4 VI V^ U 


tgttgcaaaa 


24932 


cttteoctct 

W W W WWM w W W W 


ttra t cctccr 

w l>o w w ^ w wy 


aacactccffcf 
u. a ^y y y 


era t c ct cr c c e 

W ws w w w w y w w> w» 


accacctoct 

VJ *w» V>* Wt W» V^ V* VJ v» 


ccgcgctgcc 


24992 


ct ccrcractt c 


gt geegctga 


cctt ccgega 


gtgccccccg 


ccgctctgga 


gccactgcta 


25052 


cttcrctcrccrc 

w w vy w wy **y ^ 


ctcrcr ccaact 

w w y y w« w i_* u w w 


acctggccta 


ccact egg a c 


ataatcaacfcf 

y wy owwyoyy 


acatcacrcaa 


25112 


eoaeracrtctcr 

y a y y y w y 


ctercf acrtcrcc 


a c t ci c ccr c t cr 

vi w» w- y wy w «*y 


caacctctoc 

V* VI \0 Vrf w V W VJ 


acgccgcacc 


cctccctaac 


25172 


chocaacccc 
wwywoowwww 


eaacterctcra 
w u V4 w wy w y 


cr c a a cr a eeea 

y w w *a y w w wo 


aatcatccfcrc 

y ^* w wwv w w y y w 


accttccraat 

V* V^ Vrf V» V» VprfNJ Vft VJ V* 


tacaacraccc 

V* VJ V > V* VJ VJ v^ v* v> 


25232 


acs acetic a etc 
l>u y wy 9 ^y y w 


era cr cr a c a a ercr 
y ^ y y y vuny u 


crcr a cr t c a a a 

y y y y u wy o o 


actcacccca 

o w w wo w w w wy 


cr cf cf c t cf t cf cf a 

y yy wwy wyy a 


cctccfcr ccta 

ww w wy y w>w wo 


25292 


w u uy wy wooy 




acrcra ctACca 

oy yow wo w wo 


tcccttcdan 
w w w w w w wy o y 


a teacrcrfctet 

o w wuy y w w w w 


a ccracrcr a cca 

o w y o y y o w wo 


25352 


aU<viv^ay w w. y 


wwwetcxyyos^y 


ay uuy wwyyw 


ctflcotcatc 

w wy wy w wo w w 


a eceacrcrcfcrcr 

a ww wa y y y y y 


w wa w w w wy y w 


25412 


ecaattgeaa 


gccatccaga 


aatcccgcca 


agaatttctg 


ctgaaaaagg 


gecaeggggt 


25472 


ctacttggac 


ccccagaccg 


gagaggagct 


caaccccagc 


ttcccccagg 


atgccccgag 


25532 


gaagcagcaa 


gaagctgaaa 


gtggagctgc 


cgccgccgga 


ggatttggag 


gaagactggg 


25592 
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agagcagtca 


ggcagaggag 


gaggagatgg 


aagactggga 


cagcactcag 


gcagaggagg 


25652 


acagcctgca 


agacagtctg 


gaggaggaag 


acgaggtgga 


ggaggcagag 


gaagaagcag 


25712 


ccgccgccag 


accgtcgtcc 


tcggcggaga 


aagcaagcag 


cacggatacc 


atctccgctc 


25772 


cgggtcgggg 


tcgcggcggc 


cgggcccaca 


gtaggtggga 


egagaceggg 


cgcttcccga 


25832 


accccaccac 


ccagaccggt 


aagaaggagc 


ggcagggata 


caagtcctgg 


egggggcaca 


25892 


aaaacgccat 


cgtctcctgc 


ttgcaagcct 


gcgggggcaa 


catctccttc 


acccggcgct 


25952 


acctgctctt 


ccaccgcggg 


gtgaacttcc 


cccgcaacat 


ettgeattae 


taccgtcacc 


26012 


tccacagccc 


ctactactgt 


ttccaagaag 


aggcagaaac 


ccagcagcag 


cagaaaaeca 


26072 


gcggcagcag 


cagctagaaa 


atccacagcg 


gcggcaggtg 


gactgaggat 


cgcggcgaac 


26132 


gagccggcgc 


agacccggga 


gctgaggaac 


eggatcttte 


ccaccctcta 


tgccatcttc 


26192 


cagcagagtc 




gcaggaactg 


aaagtcaaga 


accgttctct 


gcgctcgctc 


26252 


acccgcagtt 


gtctgtatca 


caagagcgaa 


gaccaacttc 


agcgcactct 


cgaggacgcc 


26312 


gaggctctct 


tcaacaagta 


ctgcgcgctc 


actcttaaag 


agtagcccgc 


gcccgcccac 


26372 


acacggaaaa 


aggcgggaat 


tacgt caeca 


cctgcgccct 


tcgcccgacc 


atcatgagca 


26432 


aagagattcc 


cacgccttac 


atgtggagct 


accagcccca 


gatgggcctg 


gccgccggcg 


26492 


ccgcccagga 


ctactccacc 


cgcatgaact 


ggctcagtgc 


cgggcccgcg 


atgatctcac 


26552 


gggtgaatga 


catccgcgcc 


caccgaaacc 


agatactcct 


agaacagtca 


gcgatcaccg 


26612 


ccacgccccg 


ccatcacctt 


aatccgcgta 


attggcccgc 


cgccctggtg 


taccaggaaa 


26672 


ttccccagcc 


cacgaccgta 


ctacttccgc 


gagacgccca 


ggccgaagtc 


cagctgacta 


26732 


actcaggtgt 


ccagctggcc 


ggcggcgccg 


ccctgtgtcg 


tcaccgcccc 


gctcagggta 


26792 


taaagcggct 


ggtgatccga 


ggcagaggca 


cacagctcaa 


cgacgaggtg 


gtgagctctt 


26852 


cgctgggtct 


gcgacctgac 


ggagtcttcc 


aactcgccgg 


ateggggaga 


tcttccttca 


26912 


cgcctcgtca 


ggccgtcctg 


actttggaga 


gttcgtcctc 


gcagccccgc 


tegggeggea 


26972 


tcggcactct 


ccagttcgtg 


gaggagttca 


ctccctcggt 


ctacttcaac 


cccttctccg 


27032 


gctcccccgg 


ccactacccg 


gacgagttca 


tcccgaactt 


cgacgccatc 


agegagtegg 


27092 


tggacggcta 


cgattgaatg 


tcccatggtg 


gcgcagctga 


cctagctcgg 


cttcgacacc 


27152 


tggaccactg 


ccgccgcttc 


cgctgcttcg 


ctegggatet 


cgccgagttt 


gectactttg 


27212 


agctgcccga 


ggagcaccct 


cagggcccag 


cccacggagt 


geggatcate 


gtcgaagggg 


27272 



55 



WO 03/046124 PCT/US02/33645 



gcctcgactc 


ccacctgctt 


cggatcttca 


gccagcgacc 


gatcctggtc 


oaaccrcaaac 


27332 


aaggacagac 


ccttcttact 


ttgtactgca 


tctgcaacca 


ccccggcctg 


cataaaacrtc 


27392 


tttgttgtct 


actatatact 


eiacjtataata 


aaacrctcracra 


tcacfcaacta 


ctcccrcractc 


27452 


aattataata 

ywvwyyyy wy 


ttcctgctat 


caaccaatcc 


ctcrttcttca 

w fc» y v w w w w> sob 


ccaooaacaa 


ci a c e* cr a cr et c* 


27512 


cagctccagt 


crtaaacccca 


caacraacrtac 


c t ca c c t cr cc c 


t cr 1 1 c ca cr cr cr 

b>y w Lrwwcvy yy 


ct c c* c* ccr a t c* 


27572 


gccgttgtca 


accactgcga 


caacgacgga 


atcctcr ctaa 


crccfcr ccctcfc 


caaccttact 


27632 


ttttccaccc 


ocaaaaacaa 


gctccagctc 


ttccaaccct 


tcctcccccrcr 

w w w Vp« w w w y y 


cr a c c t a t ca cr 


27692 


tgcg tctcag 


gaccctgcca 


tcacaccttc 


cacctcratcc 


ccraatancac 




21152. 


cccgctacta 


acaaccaaac 


tacccaccaa 


ccfccacccftc 


cr ccracctttc 


ct ct era a tct 

www wy bb ww w 


27812 


aataccacta 


ccaaaacrtaa 

vwyywyy wya 


cr ctcccraercr t 


ccraccaaeet 


c crcr cf a a 

^ uyy y n i» wb 


w wo wy y www w 


27872 


tgggaqqtqq 


taaaattaat 
3333 ww ** ,i * w 


aacactaacrc 


etagttgegg 


OtCfCfCf ctttt 
y t»yyyw^ ivw v 


ercfttct etcrc 

y y w w w w uy w 


27932 


tacctatacc 


tcccttgctg 


ttcgtactta 


otaatoctat 
y w y y w^\*i.yu 


y v. uy wwy y ^, v 


ta a era a at or or 


27992 


ggaagatcac 


cctagtgagc 


tacaaterccrc 


tacftaaccr crt 

vyy i-yyvyy u 


CfttCfCtttCCf 


a 1 1 er t ft cr era c» 

b w wy wy y y a w 


28052 


taaaccaccrc 


acrctatacrta 

33 v w 3 w»y *-y 


aaaaacraaaa 


ccaatcccta 


cttcrca ttt n 


aaterraara 


28112 


aatgccagct 


era cr 1 1 1 1 c a a 

3** 3 WWWWWBy 


cccrra t cr cr ca 


a i" r* ant" a r* rf ^ 
a uuyy Ly uyu 


yyLo^t.y<itu 


dcly Wy wy ya w 


OQ1 TO 


aaaaatacaa 


aaacataaaa 


at ccra eft a ca 

b ^ vywy w b w u. 


ataacaacrac 




a c*t" c*t* rrrorrt 
ct w w w w wy wy w 


28232 


ccgtgtggca 


occcqqagac 


cccaaatacrt 

wvwyay *-y y *- 


acaccgtctc 


tatccccaat 


er c t ci a ccrcr et 

y *» ** y b «*y y w w 


28292 


ccccgcgcac 


cgtgaataat 


actttcattt 


ttqeacacat 


atacaacaca 


atcatcrtaaa 

y fc* wu w y wy wb 


28352 


tgagcaagca 


atacaatata 


taocccccca 


ccfaacfaacraa 


cat cert cfcrtc 

wb tawy uyy WW 


www ww> wb w wy 


28412 


cttacagcct 


atacacaaca 
y ^y *~a.wy y ^y 


ctaatcacccf 

V BU W VB wwy 


ct a t c cr t n t cr 


cctcracrca tt 


ca f*a t cr ct ra 

WO Wb W y W w Wb 


28472 


tccicfcattccf 


ccccacfaaat* 

w WVmVJ B B B W 


a a tcrcc. era era 
aa ^*y y y 


aarrarraaa pa 


cr era t a a ca c 


crttttt-tcac 

yUULUL wwclw 


^ 0 j J4 


acaccttcrtt 


tttacaaaca 


a t cr ccr t c t cf t 


taaattttfct 


a a a c*a ttctt n 
bbb "w*d l> ^-y y 


r*+" ca rt^ a +" *^ 
w w Lay wd w wy 


285Q2 


ct tat erect c 


tocrttataca 


aacatacacta 


aaaecc^fta 

mCICIw^ULL wb 


i.y uayya u> u» t- 


ca f- nrff" a ca 
y d wy y Wb wdw 


9ftfi59 


taaacraatac 


ccaatcacaa 


Cf c ca a a a t "fc ft 


da tcfCftai" t* t 
' — ci uyy K- cl L» k» o 




a a i**a r»^/raf ^ 
daCoCLyatv 




cagttaaact 


ttataaacat 


cfaattaccfTc 

y w* w-4 w w y v»v^y 


cft"aca rataa 
y Lavciva wbb 


a a c*t f ca c 1 1 


acatt+ - arftt• 
bWbWW wctywi. 


28772 


crcacrcaataa 


W B B WVrW UQ Wb 






B v>b Ub LUyy l 


a /^f - 4~ a +* a r»a 
dClLfltta Ca 


288*^2 


gtacaaactt 


tcatacagga 


caagataaat 


attatactgt 


taaggtagaa 


aatcctacca 


28892 


ctcctagaac 


taccaccacc 


accactactg 


caaagcccac 


tgtgaaaact 


acaactagga 


28952 


ccaccacaac 


tacagaaacc 


accaccagca 


caacacttgc 


tgcaactaca 


cacacacaca 


29012 
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ctaagctaac cttacagacc actaatgatt tgatcgccct gctgcaaaag ggggataaca 29072 

gcaccacttc caatgaggag atacccaaat ccatgattgg cattattgtt gctgtagtgg 29132 

tgtgcatgtt gatcatcgcc ttgtgcatgg tgtactatgc cttctgctac agaaagcaca 29192 

gactgaacga caagctggaa cacttactaa gtgttgaatt ttaatttttt agaaccatga 29252 

agatcctagg cctttttagt ttttctatca ttacctctgc tctttgtgaa tcagtggata 29312 

gagatgttac tattaccact ggttctaatt atacactgaa agggccaccc tcaggtatgc 29372 

tttcgtggta ttgctatttt ggaactgaca ctgatcaaac tgaattatgc aattttcaaa 29432 

aaggcaaaac ctcaaactct aaaatctcta attatcaatg caatggcact gatctgatac 29492 

tactcaatgt cacgaaagca tatggtggca gttattattg ccctggacaa aacactgaag 29552 

aaatgatttt ttacaaagtg gaagtggttg atcccactac accacccacc accacaacta 29612 

ttcataccac acacacagaa caaacaccag aggcaacaga agcagagttg gccttccagg 29672 

ttcacggaga ttcctttgct gtcaataccc ctacacccga tcagcggtgt ccggggccgc 29732 

tagtcagcgg cattgtcggt gtgctttcgg gattagcagt cataatcatc tgcatgttca 29792 

tttttgcttg ctgctataga aggctttacc gacaaaaatc agacccactg ctgaacctct 29852 

atgtttaatt ttttccagag ccatgaaggc agttagcgct ctagtttttt gttctttgat 29912 

tggcattgtt tttaatagta aaattaccag agttagcttt attaaacatg ttaatgtaac 29972 

tgaaggagat aacatcacac tagcaggtgt agaaggtgct caaaacacca cctggacaaa 30032 

ataccatcta ggatggagag atatttgcac ctggaatgta acttattatt gcataggagt 30092 

taatcttacc attgttaacg ctaaccaatc tcagaatggg ttaattaaag gacagagtgt 30152 

tagtgtgacc agtgatgggt actataccca gcatagtttt aactacaaca ttactgtcat 30212 

accactgcct acgcctagcc cacctagcac taccacacag acaaccacat acagtacatc 30272 

aaatcagcct accaccacta cagcagcaga ggttgccagc tcgtctgggg tccgagtggc 30332 

atttttgatg ttggccccat ctagcagtcc cactgctagt accaatgagc agactactga 30392 

atttttgtcc actgtcgaga gccacaccac agctacctcc agtgccttct ctagcaccgc 30452 

caatctctcc tcgctttcct ctacaccaat cagccccgct actactccta gccccgctcc 30512 

tcttcccact cccctgaagc aaacagacgg cggcatgcaa tggcagatca ccctgctcat 30572 

tgtgatcggg ttggtcatcc tggccgtgtt gctctactac atcttctgcc gccgcattcc 30632 

caacgcgcac cgcaagccgg cctacaagcc catcgttatc gggcagccgg agccgcttca 30692 
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aatctaaaaa 


atcttctctt 


ctcttttaca 


qtatgqtgat 


tgaactatga 


30752 


ttcctaaaca 


attcttaatc 


actattctta 


tctgcctcct 


ccaagtctgt 


gccaccctcg 


30812 


ctctaataac 


caaccrccact 


ccagactgta 


ttgggccctt 


cgcctcctac 


gtgctctttg 


30872 


ccttcatcac 


ctgcatctgc 


tactotaaca 


tagtctgect 


gcttatcacc 


ttcttccagt 


30932 


tcattaacta 


era t ctttata 


cgcatcgcct 


acctgcgcca 


ccacccccag 


taccgcgacc 


30992 


aacaaataac 

w%y wy w» *j ;J 


gcagctgctc 


aggctcctct 


gataagcatg 


cqaqctctac 


tacttctcgc 


31052 


acttctacta 


ttagtgctcc 


cccgtcccgt 


cgacccccgg 


tcccccactc 


agtcccccga 


31112 


eraaaattcac 
yy y ? s» wwy w 


aaatacaaat 


tccaaoaacc 


ctggaaattc 


etcaaatget 


accgccaaaa 


31172 


atcaaacato 


catcccaact 


aaatcataat 


cattaaaatc 


gtgaacattc 


tggcctgcac 


31232 


cctcatctcc 


tttataattt 


a cccct arc tt 


taactttaat 


tcrcra a ctcac 


caaaaacact 

wviy v\y y wy w w 


31292 


ct at ctccca 


cctaaaccta 


acacaccacc 


acacrcaacaa 


cctcaggcac 


acgcactacc 


31352 


accaccacag 


cctaggccac 


aatacatgee 


catattagac 


tatqaqqecq 


agccacagcg 


31412 


acccatactc 


ccccrctatt a 


attacttcaa 


tctaaccaac 


aaaaataact 

y y vt.y h wy caw w 


gacccactgg 


31472 


ccaataacaa 


cgtcaacgac 


cttctcctgg 


acatggaegg 


ccacacctca 


gagcagegae 


31532 


i» wy u^ucici^ u> 


fceacafct eat 


vt* y wwy vwy y 


a era era crc cat 
a y wy a y**y w 


caaaaaacta 

y y ^y y 


cagga eggea 


31592 


taoccatcca 

v* y w wci w w wu 


ccacftcicaaci 


aaaaacatct 


tctacctaat 


aaaacaaacc 

y aaawayy w w 


aaaatctcct 


31652 


a err a rr cf t ca c 

a wy ayy t. «— « w 




catccrcctct 

UUU WW M W Id 


cctacoaflct 

w w w u wy wi y w w 


cctacaacaa 


caccaaaaat 

wy wwwey wiMy w 


31712 


tcacctocct 


y y uvy y c*y i— w 


aaccccatcQ 

vl v» w w w wh v> w y 


tcatcaccca 


acaatcaaac 


gataccaagg 


31772 


aatacatcca 

y y v y *— w wc* 


etcrctcctac 


gactcccccg 


actgegt cca 


cactctgatc 


aagaccctct 


31832 


dccfcrcctcco 


ccfacctcctc 


cccatgaact 


aatcaccccc 


ttatccagtg 


aaataaagat 


31892 


catattaata 


ataatttaaa 


taaaaaaaat 

V* S* %A V% V* s* v** w 


aatcatttga 


tttgaaataa 


agatacaatc 


31952 




tttcranrttta 


acaaaaataa 


aoaatcactt 


acttgaaatc 


tgataccagg 


32012 


tctctgtcca 


tgttttctgc 


caacaccacc 


tcactcccct 


cttcccagct 


ctggtactgc 


32072 


aggccccggc 


gggctgcaaa 


cttcctccac 


aegctgaagg 


ggatgtcaaa 


ttcctcctgt 


32132 


ccctcaatct 


tcattttatc 


ttctatcag atg tec aaa 
Met Ser Lys 


aag cgc gtc egg gtg 
Lys Arg Val Arg Val 


32185 



1475 

gat gat gac ttc gac ccc gtc tac ccc tac gat gca gac aac gca 32230 
Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala 
1480 1485 1490 
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ccg acc gtg ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga 32275 
Pro Thr Val Pro Phe lie Asn Pro Pro Phe Val Ser Ser Asp Gly 
1495 1500 1505 

ttc caa gag aag ccc ctg ggg gtg ttg tec ctg cga ctg get gac 32320 
Phe Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp 
1510 1515 1520 



ccc gtc acc acc aag aac ggg gaa ate acc etc 
Pro Val Thr Thr Lys Asn Gly Glu lie Thr Leu 
1525 153a 1535 



aag ctg gga gag 
Lys Leu Gly Glu 



ggg gtg gac etc gac teg 
Gly Val Asp Leu Asp Ser 
1540 * 1545 



teg gga aaa etc ate tec aac acg gec 
Ser Gly Lys Leu He Ser Asn Thr Ala 
1550 



32365 



32410 



acc aag gec gec gec cct etc agt att tea aac aac acc att tec 
Thr Lys Ala Ala Ala Pro Leu Ser He Ser Asn Asn Thr lie Ser 
1555 1560 1565 

ctt aaa act get gec cct • ttc tac aac aac aat gga act tta age 
Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu Ser 
1570 1575 1580 

etc aat gtc tec aca cca tta gca gta ttt ccc aca ttt aac act 
Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr 
1585 1590 1595 

tta ggc ata agt ctt gga aac ggt ctt cag act tea aat aag ttg 
Leu Gly He Ser Leu Gly Asn Gly Leu Gin Thr Ser Asn Lys Leu 
1600 1605" 1610 

ttg act gta caa eta act cat cct ctt aca ttc age tea aat age 
Leu Thr Val Gin Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser 
1615 1620 1625 

ate aca gta aaa aca gac aaa ggg eta tat att aac tec agt gga 
He Thr Val Lys Thr Asp Lys Gly Leu Tyr He Asn Ser Ser Gly 
1630 1635 1640 

aac aga gga ctt gag get aat ata age eta aaa aga gga eta gtt 
Asn Arg Gly Leu Glu Ala Asn He Ser Leu Lys Arg Gly Leu Val 
1645 1650 165S 

ttt gac ggt aat get att gca aca tat att gga aat ggc tta gac 
Phe Asp Gly Asn Ala He Ala Thr Tyr He Gly Asn Gly Leu Asp 
1660 1665 1670 

tat gga tct tat gat agt gat gga aaa aca aga ccc gta att acc 
Tyr Gly Ser Tyr Asp Ser Asp Gly Lys Thr Arg Pro Val He Thr 
1675 1680 1685 

aaa att gga gca gga tta aat ttt gat get aac aaa gca ata get 
Lys He Gly Ala Gly Leu Asn Phe Asp Ala Asn Lys Ala He Ala 
1690 1695 1700 



32455 



32500 



32545 



32590 



32635 



32680 



32725 



32770 



32815 



32860 
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gtc aaa eta ggc aca ggt tta agt ttt gac tec get ggt gec ttg 
Val Lys Leu Gly Thr Gly Leu Ser Phe Asp Ser Ala Gly Ala Leu 
1705 1710 1715 



32905 



aca get gga aac aaa cag gat gac aag eta aca ctt tgg act acc 
Thr Ala Gly Asn Lys Gin Asp Asp Lys Leu Thr Leu Trp Thr Thr 
1720 1725 * ^ 1730 



32950 



cct gac cca age cct aat . tgt caa tta ctt tea gac aga gat gee 
Pro Asp Pro Ser Pro Asn Cys Gin Leu Leu Ser Asp Arg Asp Ala 
1735 1740 1745 



32995 



aaa ttt act etc tgt ctt aca aaa tgc ggt agt caa ata eta ggc 
Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin lie Leu Gly 
1750 1755 1760 



33040 



act gtg gca gtg gcg get gtt act gta gga tea gca eta aat cca 
Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu Asn Pro 
1765 177a 1775 



33085 



att aat gac aca gtc aaa age gee ata gtt ttc ctt aga ttt gat 
lie Asn Asp Thr Val Lys Ser Ala lie Val Phe Leu Arg Phe Asp 
1780 * 1785 1790 



33130 



tec gat ggt gta etc atg tea aac tea tea atg gta ggt gat tac 
Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr 
1795 1800 1805 



33175 



tgg aac ttt agg gag gga ' cag acc act caa agt gta gee tat aca 
Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr Thr 
1810 1815 1820 



33220 



aat get gtg gga ttc atg cca aat ata ggt gca tat cca aaa ace 
Asn Ala Val Gly Phe Met Pro Asn lie Gly Ala Tyr Pro Lys Thr 
1825 1830 1835 



33265 



caa agt aaa aca cct aaa aat age ata gtc agt cag gta tat tta 
Gin Ser Lys Thr Pro Lys . Asn Ser lie Val Ser Gin Val Tyr Leu 
1840 1845 1850 



33310 



act gga gaa act act atg cca atg aca eta acc 
Thr Gly Glu Thr Thr Met Pro Met Thr Leu Thr 
1855 1860 1865 



ata act ttc aat 
lie Thr Phe Asn 



33355 



ggc act gat gaa aaa gac aca acc cca gtt age acc tac tct atg 
Gly Thr Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met 
1870 1875 1880 



33400 



act ttt aca tgg cag tgg act gga gac tat aag gac aaa aat att 
Thr Phe Thr Trp Gin Trp Thr Gly Asp Tyr Lys Asp Lys Asn lie 
1885 1890 1895 



33445 



acc ttt get acc aac tea ttc tct ttt tec tac ate gec cag gaa 
Thr Phe Ala Thr Asn Ser Phe Ser Phe Ser Tyr lie Ala Gin Glu 
1900 1905 1910 



33490 



taa tcccacccag caagccaacc ccttttccca ccacctttgt ctatatggaa 



33543 
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actctgaaac 


agaaaaataa 


agttcaagtg 


ttttattgaa 


tcaacagttt 


tacaggactc 


33603 


gagcagttat 


ttttcctcca 


ccctcccagg 


acatggaata 


caccaccctc 


tccccccgca 


33663 


cagccttgaa 


catctgaatg 


ccattggtga 


tggacatgct 


tttggtctcc 


acgttccaca 


33723 


cagtttcaga 


gcgagccagt 


ctcggatcgg 


tcagggagat 


gaaaccctcc 


gggcactccc 


33763 


gcatctgcac 


ctcacagctc 


aacagctgag 


gattgtcctc 


ggtggtcggg 


atcacggtta 


33843 


tctggaagaa 


gcagaagagc 


ggcggtggga 


atcatagtcc 


gcgaacggga 


tcggccggtg 


33903 


gtgtcgcatc 


aggccccgca 


gcagtcgctg 


ccgccgccgc 


tccgtcaagc 


tgctgctcag 


33963 


ggggttcggg 


tccagggact 


ccctcagcat 


gatgcccacg 


gccctcagca 


tcagtcgtct 


34023 


ggtgcggcgg 


gcgcagcagc 


gcatgcgaat 


ctcgctcagg 


tcactgcagt 


acgtgcaaca 


34083 


caggaccacc 


aggttgttca 


acagtccata 


gttcaacacg 


ctccagccga 


aactcatcgc 


34143 


gggaaggatg 


ctacccacgt 


ggccgtcgta 


ccagatcctc 


aggtaaatca 


agtggcgctc 


34203 


cctccagaag 


acgctgccca 


tgtacatgat 


ctccttgggc 


atgtggcggt 


tcaccacctc 


34263 


ccggtaccac 


atcaccctct 


ggttgaacat 


gcagccccgg 


atgatcctgc 


ggaaccacag 


34323 


ggccagcacc 


gccccgcccg 


ccatgcagcg 


aagagacccc 


ggatcccggc 


aatgacaatg 


34383 


gaggacccac 


cgctcgtacc 


cgtggatcat 


ctgggagctg 


aacaagtcta 


tgttggcaca 


34443 


gcacaggcat 


atgctcatgc 


atctcttcag 


cactctcagc 


tcctcggggg 


tcaaaaccat 


34503 


atcccagggc 


acggggaact 


cttgcaggac 


agcgaacccc 


gcagaacagg 


gcaatcctcg 


34563 


cacataactt 


acattgtgca 


tggacagggt 


atcgcaatca 


ggcagcaccg 


ggtgatcctc 


34623 


caccagagaa 


gcgcgggtct 


cggtctcctc 


acagcgtggt 


aagggggccg 


gccgatacgg 


34683 


gtgatggcgg 


gacgcggctg 


atcgtgttct 


cgaccgtgtc 


atgatgcagt 


tgctttcgga 


34743 


cattttcgta 


cttgctgtag 


cagaacctgg' 


tccgggcgct 


gcacaccgat 


cgccggcggc 


34803 


ggtctcggcg 


cttggaacgc 


tcggtgttaa 


agttgtaaaa 


cagccactct 


ctcagaccgt 


34863 


gcagcagatc 


tagggcctca 


ggagtgatga 


agatcccatc 


atgcctgata 


gctctgatca 


34923 


catcgaccac 


cgtggaatgg 


gccaggccca 


gccagatgat 


gcaattttgt 


tgggtttcgg 


34983 


tgacggcggg 

^3 33 333 


ggagggaaga 


acaggaagaa 


ccatgattaa 


cttttaatcc 


aaacggtctc 


35043 


ggagcacttc 


aaaatgaagg 


tcacggagat 


ggcacctctc 


gcccccgctg 


tgttggtgga 


35103 


aaataacagc 


caggtcaaag 


gtgatacggt 


tctcgagatg 


ttccacggtg 


gcttccagca 


35163 


aagcctccac 


gcgcacatcc 


agaaacaaga 


caatagcgaa 


agcgggaggg 


ttctctaatt 


35223 
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cctcaaccat catgttacac tcctgcacca tccccagata attttcattt ttccagcctt 35283 

gaatgattcg aactagttcc tgaggtaaat ccaagccagc catgataaaa agctcgcgca 35343 

gagcaccctc caccggcatt cttaagcaca ccctcataat tccaagatat tctgctcctg 35403 

gttcacctgc agcagattga caagcggaat atcaaaatct ctgccgcgat ccctgagctc 35463 

ctccctcagc aataactgta agtactcttt catatcgtct ccgaaatttt tagccatagg 35523 

acccccagga ataagagaag ggcaagccac attacagata aaccgaagtc ccccccagtg 35583 

agcattgcca aatgtaagat tgaaataagc atgctggcta gacccggtga tatcttccag 35643 

ataactggac agaaaatcgg gtaagcaatt tttaagaaaa tcaacaaaag aaaaatcttc 35703 

caggtgcacg tttagggcct cgggaacaac gatggagtaa gtgcaagggg tgcgttccag 35763 

catggttagt tagctgatct gtaaaaaaac aaaaaataaa acattaaacc atgctagcct 35823 

ggcgaacagg tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc 35883 

gaccctcgta aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc 35943 

cggcgtgaat gattcgagaa gaagcataca cccccggaac attggagtcc gtgagtgaaa 36003 

aaaagcggcc gaggaagcaa tgaggcacta caacgctcac tctcaagtcc agcaaagcga 36063 

tgccatgcgg atgaagcaca aaattttcag gtgcgtaaaa aatgtaatta ctcccctcct 36123 

gcacaggcag cgaagctccc gatccctcca gatacacata caaagcctca gcgtccatag 36183 

cttaccgagc ggcagcagca gcggcacaca acaggcgcaa gagtcagaga aaagactgag 36243 

ctctaacctg tccgcccgct ctctgctcaa tatatagccc cagatctaca ctgacgtaaa 36303 

ggccaaagtc taaaaatacc cgccaaataa tcacacacgc ccagcacacg cccagaaacc 36363 

ggtgacacac tcagaaaaat acgcgcactt cctcaaacgg ccaaactgcc gtcatttccg 36423 

ggttcccacg ctacgtcatc aaaacacgac tttcaaattc cgtcgaccgt taaaaacatc 36483 

acccgccccg cccctaacgg tcgccgctcc cgcagccaat caccttcctc cctccccaaa 36543 

ttcaaacagc tcatttgcat attaacgcgc accaaaagtt tgaggtatat tattgatgat 36603 

36604 



<210> 6 

<211> 529 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 
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<400> 6 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
15 10 15 

Val Met Gin Gin Ala Val Al5> Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
50 55 60 

Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn 
65 * 70 75 80 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr lie Asn Phe Asp Glu Arg Ser Arg 
100 105 110 

Trp Gly Gly Gin Leu Lys Thr He Met His Thr Asn Met Pro Asn Val 
115 120 125 

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 
130 ~ 135 " 140 

Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr Asp Gly Ser Gin Asp 
145 150 155 160 

Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe 
165 170 175 

Ser Val Thr Met Thr He Asp Leu Met Asn Asn Ala He He Asp Asn 
180 185 190 

Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp He Gly 
195 200 205 

Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr 
210 215 220 

Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp 
225 230 235 240 

He Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu 
245 250 255 

Ser Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe 
260 265 270 



Gin He Leu Tyr Glu Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu 
275 280 285 
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Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp Ser Thr Ala Ala Ala 
290 295 300 

Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe 
305 310 315 320 

Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys lie 
325 330 335 

Val He Gin Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr Asn Val 
340 345 350 

Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr 
355 360 365 

Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr 
370 375 380 

Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser Leu Pro 
385 390 395 400 

Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin Val Ser 
405 410 415 

Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser 
420 425 430 

Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe Thr 
435 440 445 

Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin lie Leu Val 
450 455 460 

Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val Pro Ala 
465 470 475 480 

Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser He Arg Gly Val 
485 490 495 

Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val 
500 505 510 

Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr 
515 520 525 

Phe 



<210> 7 
<211> 942 
<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 
<400> 7 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His lie Ala 
15 10 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 * 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Ser Ser Gin Trp Glu Gin Ala Lys Thr Gly Asn Gly Gly 
130 135 140 

Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu Asn 
145 150 155 160 

lie Thr Lys Asp Gly Leu Gin lie Gly Thr Asp Val Thr Ala Asn Gin 
165 170 175 

Asn Lys Pro lie Tyr Ala Asp Lys Thr Phe Gin Pro Glu Pro Gin Val 
180 185 190 

Gly Glu Glu Asn Trp Gin Glu Thr Glu Asn Phe Tyr Gly Gly Arg Ala 
195 200 205 

Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr Gly Ser Tyr Ala Arg 
210 215 220 

Pro Thr Asn Glu Lys Gly Gly Gin Ala Lys Leu Lys Val Gly Asp Asp 
225 230 235 240 

Gly Val Pro Thr Lys Glu Phe Asp lie Asp Leu Ala Phe Phe Asp Thr 
245 250 255 

Pro Gly Gly Thr Val Asn Gly Gin Asp Glu Tyr Lys Ala Asp lie Val 
260 265 270 

Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp Thr His Val Val 
275 280 285 

Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu lie Asn Leu Val Gin 
290 295 300 

Gin Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe 
305 310 315 320 
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lie Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala 
325 330 335 

Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn 
340 345 350 

Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr 
355 360 365 

Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp 
370 375 380 

Val Arg He He Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr 
385 ' 390 395 400 

Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala Tyr Gin Gly Val 
405 410 415 

Lys Val Lys Asp Gly Gin Asp Gly Asp Val Glu Ser Glu Trp Glu Asn 
420 " 425 430 

Asp Asp Thr Val Ala Ala Arg Asn Gin Leu Cys Lys Gly Asn He Phe 
435 440 445 

Ala Met Glu He Asn Leu Gin Ala Asn Leu Trp Arg Ser Phe Leu Tyr 
450 455 460 

Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Thr 
465 470 475 480 

Asn Val Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly 
485 490 495 

Arg Val Thr Pro Pro Ser Leu Val Asp Ala Tyr Leu Asn He Gly Ala 
500 505 510 

Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His 
515 520 525 

Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg 
530 535 540 

Tyr Val Pro Phe His He Gin Val Pro Gin Lys Phe Phe Ala He Lys 
545 550 555 560 

Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg 
565 570 575 

Lys Asp Val Asn Met He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg 
580 585 590 

Thr Asp Gly Ala Ser He Ala Phe Thr Ser He Asn Leu Tyr Ala Thr 
595 600 605 

Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu 
610 615 620 
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Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala Ala 
625 630 635 640 

Asn Met Leu Tyr Pro He Pro Ala Asn Ala Thr Asn Val Pro He Ser 
645 650 655 

lie Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg 
660 665 " 670 

Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr 
675 680 685 

Phe Val Tyr Ser Gly Ser He Pro Tyr Leu Asp Gly Thr Phe Tyr Leu 
690 695 700 

Asn His Thr Phe Lys Lys Val Ser He Thr Phe Asp Ser Ser Val Ser 
05 710 • 715 720 

Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu He Lys 
725 730 735 

Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Cys Asn Met Thr 
740 745 750 

Lys Asp Trp Phe Leu Val Gin Met Leu Ala His Tyr Asn He Gly Tyr 
755 760 765 

Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe 
770 775 780 

Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn 
785 790 795 800 

Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser 
805 810 815 

Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr 
8 20 825 830 

Pro Ala Asn Tyr Pro Tyr Pro Leu He Gly Lys Ser Ala Val Ala Ser 
835 840 845 

Val Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg He Pro 
850 855 860 

Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gin 

005 es->n 



870 



875 



880 



Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu 
885 890 895 

Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val 
900 905 ' 910 

Phe Asp Val Val Arg Val His Gin Pro His Arg Gly Val He Glu Ala 
915 920 925 
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Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
930 935 940 



<210> 8 

<211> 443 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 

<400> 8 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
.1 5 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe lie Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu lie Thr Leu 
50 55 60 

Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu lie Ser 
65 70 75 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser lie Ser Asn Asn Thr 
85 90 95 

lie Ser Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu 
100 105 110 

Ser Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr 
115 120 125 

Leu Gly lie Ser Leu Gly Asn Gly Leu Gin Thr Ser Asn Lys Leu Leu 
130 135 140 

Thr Val Gin Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser lie Thr 
145 150 155 160 

Val Lys Thr Asp Lys Gly Leu Tyr He Asn Ser Ser Gly Asn Arg Gly 
165 170 175 

Leu Glu Ala Asn He Ser Leu Lys Arg Gly Leu Val Phe Asp Gly Asn 
180 185 190 

Ala He Ala Thr Tyr He Gly Asn Gly Leu Asp Tyr Gly Ser Tyr Asp 
195 200 205 

Ser Asp Gly Lys Thr Arg Pro Val He Thr Lys He Gly Ala Gly Leu 
210 215 220 

Asn Phe Asp Ala Asn Lys Ala He Ala Val Lys Leu Gly Thr Gly Leu 
225 230 235 * 240 
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Ser Phe Asp Ser Ala Gly Ala Leu Thr Ala Gly Asn Lys Gin Asp Asp 
245 ' 250 255 

Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Leu 
260 265 270 

Leu Ser Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly 
275 280 285 

Ser Gin He Leu Gly Thr Val Ala Val Ala Ala Val Thr Val' Gly Ser 
290 295 300 

Ala Leu Asn Pro He Asn Asp Thr Val Lys Ser Ala He Val Phe Leu 
305 310 315 320 

Arg Phe Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly 
325 330 335 

Asp Tyr Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr 
340 345 350 

Thr Asn Ala Val Gly Phe Met Pro Asn He Gly Ala Tyr Pro Lys Thr 
355 360 365 

Gin Ser Lys Thr Pro Lys Asn Ser He Val Ser Gin Val Tyr Leu Thr 
370 ~ 375 380 

Gly Glu Thr Thr Met Pro Met Thr Leu Thr He Thr Phe Asn Gly Thr 
385 390 395 400 

Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr 
405 410 415 

Trp Gin Trp Thr Gly Asp Tyr Lys Asp Lys Asn He Thr Phe Ala Thr 
420 425 430 

Asn Ser Phe Ser Phe Ser Tyr He Ala Gin Glu 
435 440 



<210> 9 

<211> 36535 1 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan7 

<220> 

<221> CDS 

<222> (13874) . . (15469) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (18288) . . (21086) 

<223> L3 Hexon 
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<220> 

<221> CDS 

<222> (32094) (33425) 

<223> L5 Fiber 

<400> 9 



catcatcaat 


aatatacctc 


aaacttttgg tgcgcgttaa tatgcaaatg 


agctgtttga 


60 


atttggggag 


ggaggaaggt 


gattggccga gagacgggcg accgttaggg 


gcggggcggg 


120 


tgacgttttt 


aatacgtggc 


cgtgaggcgg agccggtttg caagttctcg 


tgggaaaagt 


180 


gacgtcaaac 


gaggtgtggt 


ttgaacacgg aaatactcaa ttttcccgcg 


ctctctgaca 


240 


ggaaatgagg 


tgtttctggg 


cggatgcaag tgaaaacggg ccattttcgc 


gcgaaaactg 


300 


aatgaggaag 


tgaaaatctg 


agtaatttcg cgtttatggc agggaggagt 


atttgccgag 


360 


ggccgagtag 


actttgaccg 


attacgtggg ggtttcgatt accgtatttt 


tcacctaaat 


420 


ttccgcgtac 


ggtgtcaaag 


tccggtgttt ttacgtaggc gtcagctgat 


cgccagggta 


480 


tttaaacctg 


cgctctctag 


tcaagaggcc actcttgagt gccagcgagt 


agagttttct 


540 


cctccgcgcc 


gcgagtcaga 


tctacacttt gaaagatgag gcacctgaga 


gacctgcccg 


600 


gtaatgtttt 


cctggctact 


gggaacgaga ttctggaatt ggtggtggac 


gccatgatgg 


660 


gtggcgaccc 


tcctgagccc 


cctaccccat ttgaggcgcc ttcgctgtac 


gatttgtatg 


720 


atctggaggt 


ggatgtgccc 


gagaacgacc ccaacgagga ggcggtgaat 


gatttgttta 


780 


gcgatgccgc 


gctgctggct 


gccgagcagg ctaatacgga ctctggctca 


gacagcgatt 


840 


cctctctcca 


taccccgaga 


cccggcagag gtgagaaaaa gatccccgag 


cttaaagggg 


900 


aagagctcga 


cctgcgctgc 


tatgaggaat gcttgcctcc gagcgatgat 


gaggaggacg 


960 


aggaggcgat 


tcgagctgca 


tcgaaccagg gagtgaaagc tgcgggcgaa 


agctttagcc 


1020 


tggactgtcc 


tactctgccc 


ggacacggct gtaagtcttg tgaatttcat 


cgcatgaata 


1080 


ctggagataa 


gaatgtgatg 


tgtgccctgt gctatatgag agcttacaac 


cattgtgttt 


1140 


acagtaagtg 


tgattaactt 


tagttgggaa ggcagagggt gactgggtgc 


tgactggttt 


1200 


atttatgtat 


atgttttttt 


atgtgtaggt cccgtctctg acgtagatga 


gacccccact 


1260 


tcagagtgca 


tttcatcacc 


cccagaaatt ggcgaggaac cgcccgaaga 


tattattcat 


1320 


agaccagttg 


cagtgagagt 


caccgggcgg agagcagctg tggagagttt 


ggatgacttg 


1380 


ctacagggtg 


gggatgaacc 


tttggacttg tgtacccgga aacgccccag 


gcactaagtg 


1440 


ccacacatgt 


gtgtttactt 


aaggtgatgt cagtatttat agggtgtgga 


gtgcaataaa 


1500 
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1800 
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ctcactttaa 
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w^ w w w w wy y w 
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ettttttgee 
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atggtacttg atctccacgg cgccgttggt ggcgacgtcc acggcttgca gggtcccgtg 8400 

cccctggggc gccaccaccg tgccccgttt cttcttgggc gctgcttcca tgccggtcag 8460 

aagcggcggc gaggacgcgc gccgggcggc aggggcggct cgggacccgg aggcaggggc 8520 

ggcaggggca cgtcggcgcc gcgcgcgggc aggttctggt actgcgcccg gagaagactg 8580 

gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac gcctctgggt gaaggccacg 8640 

ggacccgtga gtttgaacct gaaagagagt tcgacagaat caatctcggt atcgttgacg 8700 

gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt cctggtaggc gatctcggtc 8760 

atgaactgct cgatctcctc ctcctgaagg tctccgcggc cggcgcgctc gacggtggcc 8820 

gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg cgttcatgcc ggcctcgttc 8880 

cagacgcggc tgtagaccac ggctccgtcg gggtcgcgcg cgcgcatgac cacctgggcg 8940 

aggttgagct cgacgtggcg cgtgaagacc gcgtagttgc agaggcgctg gtagaggtag 9000 

ttgagcgtgg tggcgatgtg ctcggtgacg aagaagtaca tgatccagcg gcggagcggc 9060 

atctcgctga cgtcgcccag ggcttccaag cgctccatgg cctcgtagaa gtccacggcg 9120 

aagttgaaaa actgggagtt gcgcgccgag acggtcaact cctcctccag aagacggatg 9180 

agctcagcga tggtggcgcg cacctcgcgc tcgaaggccc cggggggctc ctcttcttcc 9240 

atctcttcct cctccactaa catctcttct acttcctcct caggaggcgg cggcggggga 9300 

ggggccctgc gtcgccggcg gcgcacgggc agacggtcga tgaagcgctc gatggtctcc 9360 

ccgcgccggc gacgcatggt ctcggtgacg gcgcgcccgt cctcgcgggg ccgcagcgtg 9420 

aagacgccgc cgcgcatctc caggtggccg ccgggggggt ctccgttggg cagggagagg 9480 

gcgctgacga tgcatcttat caattggccc gtagggactc cgcgcaagga cctgagcgtc 9540 

tcgagatcca cgggatccga aaaccgctga acgaaggctt cgagccagtc gcagtcgcaa 9600 

ggtaggctga gcccggtttc ttgttcttcg gggatttcgg gaggcgggcg ggcgatgctg 9660 

ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 

tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780 

cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840 

cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct ggggctggac gagcgccagg 9900 

tcggcgacga cgcgctcggc gaggatggcc tgctgtatct gggtgagggt ggtctggaag 9960 

tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtatagga gcagttggcc 10020 

atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080 
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gagtaggcgc gcgtgtcgaa gatgtagtcg ttgcaggtgc gcacgaggta ctggtatccg 10140 

acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200 

ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga catccaggtg 10260 

atgccggcgg cggtggtgga ggcgcgcggg aactcgcgga cgcggttcca gatgttgcgc 10320 

agcggcagga agtagttcat ggtggccgcg gtctggcccg tgaggcgcgc gcagtcgtgg 10380 

atgctctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440 

gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggctggag 10500 

ccgcagctaa cgtggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560 

ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620 

ggaaagcgac cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680 

cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740 

cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800 

ggagcgagcc cctctttttc ttgtgttttt gccagatgca tcccgtactg cggcagatgc 10860 

gcccccaccc tccacctcaa ccgcccctac cgccgcagca gcagcaacag ccggcgcttc 10920 

tgcccccgcc ccagcagcag ccagccacta ccgcggcggc cgccgtgagc ggagccggcg 10980 

ttcagtatga cctggccttg gaagagggcg aggggctggc gcggctgggg gcgtcgtcgc 11040 

cggagcggca cccgcgcgtg cagatgaaaa gggacgctcg cgaggcctac gtgcccaagc 11100 

agaacctgtt cagagacagg agcggcgagg agcccgagga gatgcgcgcc tcccgcttcc 11160 

acgcggggcg ggagctgcgg cgcggcctgg accgaaagcg ggtgctgagg gacgaggatt 11220 

tcgaggcgga cgagctgacg gggatcagcc ccgcgcgcgc gcacgtggcc gcggccaacc 11280 

tggtcacggc gtacgagcag accgtgaagg aggagagcaa cttccaaaaa tccttcaaca 11340 

accacgtgcg cacgctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg 11400 

acctgctgga ggccatcgtg cagaacccca cgagcaagcc gctgacggcg cagctgtttc 11460 

tggtggtgca gcacagtcgg gacaacgaga cgttcaggga ggcgctgctg aatatcaccg 11520 

agcccgaggg ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggtgcagg 11580 

agcgcgggct gccgctgtcc gagaagctgg cggctatcaa cttctcggtg ctgagcctgg 11640 

gcaagtacta cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga 11700 

agatcgacgg gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg 11760 
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gggtgtaccg caacgacagg 


atgcaccgcg 


cggtgagcgc 


cagccgccgg 


cgcgagctga 


11820 


gcgaccagga gctgatgcac 


agcctgcagc 


gggccctgac 


cggggccggg 


accgaggggg 


11880 


agagctactt tgacatgggc 


gcggacctgc 


gctggcagcc 


cagccgccgg 


gccttggaag 


11940 


ctgccggcgg ttccccctac 


gtggaggagg 


tggacgatga 


ggaggaggag 


ggcgagtacc 


12000 


tggaagactg atggcgcgac 


cgtatttttg 


ctagatgcag 


caacagccac 


cgcctcctga 


12060 


tcccgcgatg cgggcggcgc 


tgcagagcca 


gccgtccggc 


attaactcct 


cggacgattg 


12120 


gacccaggcc atgcaacgca 


tcatggcgct 


gacgacccgc 


aatcccgaag 


cctttagaca 


12180 


gcagcctcag gccaaccggc 


tctcggccat 


cctggaggcc 


gtggtgccct 


cgcgctcgaa 


12240 


ccccacgcac gagaaggtgc 


tggccatcgt 


gaacgcgctg 


gtggagaaca 


aggccatccg 


12300 


cggcgacgag gccgggctgg 


tgtacaacgc 


gctgctggag 


cgcgtggccc 


gctacaacag 


12360 


caccaacgtg cagacgaacc 


tggaccgcat 


ggtgaccgac 


gtgcgcgagg 


cggtgtcgca 


12420 


gcgcgagcgg ttccaccgcg 


agtcgaacct 


gggctccatg 


gtggcgctga 


acgccttcct 


12480 


gagcacgcag cccgccaacg 


tgccccgggg 


ccaggaggac 


tacaccaact 


tcatcagcgc 


12540 


gctgcggctg atggtggccg 


aggtgcccca 


gagcgaggtg 


taccagtcgg 


ggccggacta 


12600 


cttcttccag accagtcgcc 


agggcttgca 


gaccgtgaac 


ctgagccagg 


ctttcaagaa 


12660 


cttgcaggga ctgtggggcg 


tgcaggcccc 


ggtcggggac 


cgcgcgacgg 


tgtcgagcct 


12720 


gctgacgccg aactcgcgcc 


tgctgctgct 


gctggtggcg 


cccttcacgg 


acagcggcag 


12780 


cgtgagccgc gactcgtacc 


tgggctacct 


gcttaacctg 


taccgcgagg 


ccatcgggca 


12840 


ggcgcacgtg gacgagcaga 


cctaccagga 


gatcacccac 


gtgagccgcg 


cgctgggcca 


12900 


ggaggacccg ggcaacctgg 


aggccaccct 


gaacttcctg 


ctgaccaacc 


ggtcgcagaa 


12960 


gatcccgccc cagtacgcgc 


tgagcaccga 


ggaggagcgc 


atcctgcgct 


acgtgcagca 


13020 


gagcgtgggg ctgttcctga 


tgcaggaggg 


ggccacgccc 


agcgccgcgc 


tcgacatgac 


13080 


cgcgcgcaac atggagccca 


gcatgtacgc 


tcgcaaccgc 


ccgttcatca 


ataagctgat 


13140 


ggactacttg catcgggcgg 


ccgccatgaa 


ctcggactac 


tttaccaacg 


ccatcttgaa 


13200 


cccgcactgg ctcccgccgc 


ccgggttcta 


cacgggcgag 


tacgacatgc 


ccgaccccaa 


13260 


cgacgggttc ctgtgggacg 


acgtggacag 


cagcgtgttc 


tcgccgcgcc 


ccgccaccac 


13320 




gggaccggcg 




gcgcugrccg 


gtcgcgcggg 




tgctgccgcg gcggtgcctg 


aggccgccag 


ccccttcccg 


agcctgccct 


tttcgctgaa 


13440 


cagcgtgcgc agcagcgagc 


tgggtcggct 


gacgcggccg 


cgcctgctgg 


gcgaggagga 


13500 
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gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca ataacgggat 

agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc acagggacga 

gccccgagct agcagcagcg caggcacccg tagacgccag cgacacgaca ggcagcgggg 

tctggtgtgg gacgatgagg attccgccga cgacagcagc gtgttggact tgggtgggag 

tggtggtggt aacccgttcg ctcacttgcg cccccgtatc gggcgcctga tgtaagaatc 

tgaaaaaata aaaaacggta ctcaccaagg ccatggcgac cagcgtgcgt tcttctctgt 

tgtttgtagt agt atg atg agg cgc gtg tac ccg gag ggt cct cct ccc 
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro 
15 10 

teg tac gag age gtg atg cag cag gcg gtg gcg gcg gcg atg cag ccc 
Ser Tyr Glu Ser Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro 
15 20 25 

ccg ctg gag gcg cct tac gtg ccc ccg egg tac ctg gcg cct acg gag 
Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu 
30 35 40 

ggg egg aac age att cgt tac teg gag ctg gca ccc ttg tac gat acc 
Gly Arg Asn Ser He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr 
45 50 55 60 

acc egg ttg tac ctg gtg gac aac aag teg gcg gac ate gec teg ctg 
Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser Leu 
65 70 75 

aac tac cag aac gac cac age aac ttc ctg acc acc gtg gtg cag aac 
Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn 
80 85 90 

aac gat ttc acc ccc acg gag gec age acc cag acc ate aac ttt gac 
Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe Asp 
95 100 105 

gag cgc teg egg tgg ggc ggc cag ctg aaa acc ate atg cac acc aac 
Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys Thr He Met His Thr Asn 
110 * " 115 120 

atg ccc aac gtg aac gag ttc atg tac age aac aag ttc aag gcg egg 
Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg 
125 130 135 140 

gtg atg gtc teg cgc aag acc ccc aat ggg gtc gcg gtg gat gag aat 
Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn 
145 150 155 

tat gat ggt agt cag gac gag ctg act tac gag tgg gtg gag ttt gag 
Tyr Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu 
160 165 170 



13560 
13620 
13680 
13740 
13800 
13860 
13909 

13957 

14005 

14053 

14101 

14149 

14197 

14245 

14293 

14341 

14389 
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ctg ccc gag ggc aac ttc teg gtg acc atg acc ate gat ctg atg aac 14437 
Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr lie Asp Leu Met Asn 
175 180 185 

aac gec ate ate gac aac tac ttg gcg gtg ggg cgt cag aac ggg gtg 14485 
Asn Ala lie lie Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val 
190 195 200 

ctg gag age gac ate ggc gtg aag ttc gac acg cgc aac ttc egg ctg 14533 
Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu 
205 210 215 220 

ggc tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc aac 14581 
Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn 
225 230 235 

gag gec ttc cac ccc gac ate gtc ctg ctg ccc ggc tgc ggc gtg gac 14629 
Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val Asp 
240 245 250 

ttc acc gag age cgc etc age aac ctg ctg ggc ate cgc aag egg cag 14677 
Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Gin 
255 260 265 

ccc ttc cag gag ggc ttc cag ate ctg tac gag gac ctg gag ggg ggc 14725 
Pro Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp Leu Glu Gly Gly 
270 275 280 

aac ate ccc gcg etc ttg gat gtc gaa gee tat gag aaa age aag gag 14773 
Asn He Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu 
285 290 295 300 

gag gec gec gca gcg gcg acc gca gec gtg gec acc gec tct acc gag 14821 
Glu Ala Ala Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu 
305 310 315 

gtg egg ggc gat aat ttt get age gec gcg gca gtg gec gag gcg get 14869 
Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala 
320 325 330 

gaa acc gaa agt aag ata gtc ate cag ccg gtg gag aag gac age aag 14917 
Glu Thr Glu Ser Lys lie Val He Gin Pro Val Glu Lys Asp Ser Lys 
335 340 345 

gac agg age tac aac gtg etc gcg gac aag aaa aac acc gec tac cgc 14965 
Asp Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg 
350 355 360 

age tgg tac ctg gec tac aac tac ggc gac ccc gag aag ggc gtg cgc 15013 
Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 
365 370 375 380 

tec tgg acg ctg etc acc acc teg gac gtc acc tgc ggc gtg gag caa 15061 
Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin 
385 390 395 
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gtc tac tgg teg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc 15109 
Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg 
400 405 410 

tec acg cgt caa gtt age aac tac ccg gtg gtg ggc gec gag etc ctg 15157 
Ser Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu 
415 420 425 

ccc gtc tac tec aag age ttc ttc aac gag cag gec gtc tac teg cag 15205 
Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin 
430 435 440 

cag ctg cgc gee ttc acc teg etc acg cac gtc ttc aac cgc ttc ccc 15253 
Gin Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro 
445 450 455 " 460 

gag aac cag ate etc gtc cgc ccg ccc gcg ccc acc att acc acc gtc 15301 
Glu Asn Gin lie Leu Val Arg Pro Pro Ala Pro Thr lie Thr Thr Val 
465 470 475 

agt gaa aac gtt cct get etc aca gat cac ggg acc ctg ccg ctg cgc 15349 
Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg 
480 485 490 

age agt ate egg gga gtc cag cgc gtg acc gtc act gac gec aga cgc 15397 
Ser Ser lie Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg 
495 500 505 

cgc acc tgc ccc tac gtc tac aag gee ctg ggc gta gtc gcg ccg cgc 15445 
Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg 
510 515 520 



gtc etc teg age cgc acc ttc taa aaaatgtcca ttctcatctc geccagtaat 
Val Leu Ser Ser Arg Thr Phe 
525 530 


15499 


aacaceggtt 


ggggcctgcg 


cgcgcccagc 


aagatgtacg 


gaggegcteg 


ccaacgctcc 


15559 


acgcaacacc 


ccgtgcgcgt 


gcgcgggcac 


ttccgcgctc 


cctggggcgc 


cctcaagggc 


15619 


cgcgtgcgct 


cgcgcaccac 


cgtcgacgac 


gtgatcgacc 


aggtggtggc 


cgacgcgcgc 


15679 


aactacacgc 


ccgccgccgc 


gcccgcctcc 


accgtggacg 


ccgtcatcga 


cagcgtggtg 


15739 


gccgatgcgc 


gccggtacgc 


ccgcgccaag 


agccggcggc 


ggcgcatcgc 


ccggcggcac 


15799 


cggagcaccc 


ccgccatgcg 


cgcggcgcga 


gccttgctgc 


geagggecag 


gcgcacggga 


15859 


cgcagggcca 


tgetcaggge 


ggccagacgc 


gcggcctccg 


gcagcagcag 


cgccggcagg 


15919 


acccgcagac 


gcgcggccac 


ggcggcggcg 


gcggccatcg 


ccagcatgtc 


ccgcccgcgg 


15979 


cgcggcaacg 


tgtactgggt 


gcgcgacgcc 


gccaccggtg 


tgcgcgtgcc 


cgtgcgcacc 


16039 


cgcccccctc 


gcacttgaag 


atgetgaett 


cgcgatgttg 


atgtgtccca 


gcggcgagga 


16099 


ggatgtccaa 


gegcaaatae 


aaggaagaga 


tgctccaggt 


catcgcgcct 


gagatctacg 


16159 
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gccccgcggt 


gaaggaggaa 


agaaagcccc 


gcaaactgaa 


gcgggtcaaa 


aaggacaaaa 


16219 


aggaggagga 


agatgtggac 


ggactggtgg 


agtttgtgcg 


cgagttcgcc 


ccqcggcggc 


16279 


gcgtgcagtg 


gcgcgggcgg 


aaagtgaaac 


cggtgctgcg 


gcccggcacc 


acggtggtct 


16339 


tcacgcccgg 


cgagcgttcc 


ggctccgcct 


ccaagcgctc 


ctacgacgag 


gtgtacgggg 


16399 


acgaggacat 


cctcgagcag 


gcggtcgagc 


gtctgggcga 


gtttgcttac 


ggcaagcgca 


16459 


gccgccccgc 


gcccttgaaa 


gaggaggcgg 


tgtccatccc 


gctggaccac 


ggcaacccca 


16519 


cgccgagcct 


gaagccggtg 


accctgcagc 


aggtgctgcc 


gagcgcggcg 


ccgcgccggg 


16579 


gcttcaagcg 


cgagggcggc 


gaggatctgt 


acccgaccat 


gcagctgatg 


gtgcccaagc 


16639 


gccagaagct 


ggaggacgtg 


ctggagcaca 


tgaaggtgga 


ccccgaggtg 


cagcccgagg 


16699 


tcaaggtgcg 


gcccatcaag 


caggtggccc 


cgggcctggg 


cgtgcagacc 


gtggacatca 


16759 


agatccccac 


ggagcccatg 


gaaacgcaga 


ccgagcccgt 


gaagcccagc 


accagcacca 


16819 


tggaggtgca 


gacggatccc 


tggatgccgg 


cgccggcttc 


caccactcgc 


cgaagacgca 


16879 


agtacggcgc 


ggccagcctg 


ctgatgccca 


actacgcgct 


gcatccttcc 


atcatcccca 


16939 


cgccgggcta 


ccgcggcacg 


cgcttctacc 


gcggctacac 


cagcagccgc 


cgcaagacca 


16999 


ccacccgccg 


ccgccgtcgt 


cgcacccgcc 


gcagcagcac 


cgcgacttcc 


gccgccgccc 


17059 


tggtgcggag 


agtgtaccgc 


agcgggcgcg 


agcctctgac 


cctgccgcgc 


gcgcgctacc 


17119 


acccgagcat 


cgccatttaa 


ctctgccgtc 


gcctcctact 


tgcagatatg 


gccctcacat 


17179 


gccgcctccg 


cgtccccatt 


acgggctacc 


gaggaagaaa 


gccgcgccgt 


agaaggctga 


17239 


cggggaacgg 


gctgcgtcgc 


catcaccacc 


ggcggcggcg 


cgccatcagc 


aagcggttgg 


17299 


ggggaggctt 


cctgcccgcg 


ctgatcccca 


tcatcgccgc 


ggcgatcggg 


gcgatccccg 


17359 


gcatagcttc 


cgtggcggtg 


caggcctctc 


agcgccactg 


agacacagct 


tggaaaattt 


17419 


gtaataaaaa 


aatggactga 


cgctcctggt 


cctgtgatgt 


gtgtttttag 


atggaagaca 


17479 


tcaatttttc 


gtccctggca 


ccgcgacacg 


gcacgcggcc 


gtttatgggc 


acctggagcg 


17539 


acatcggcaa 


cagccaactg 


aacgggggcg 


ccttcaattg 


gagcagtctc 


tggagcgggc 


17599 


ttaagaattt 


cgggtccacg 


ctcaaaacct 


atggcaacaa 


ggcgtggaac 


agcagcacag 


17659 


ggcaggcgct 


gagggaaaag 


ctgaaagagc 


agaacttcca 


gcagaaggtg 


gtcgatggcc 


17719 


tggcctcggg 


catcaacggg 


gtggtggacc 


tggccaacca 


ggccgtgcag 


aaacagatca 


17779 


acagccgcct 


ggacgcggtc 


ccgcccgcgg 


ggtccgtgga 


gatgccccag 


gtggaggagg 


17839 
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agctgcctcc 


cctggacaag 


cgcggcgaca 


agcgaccgcg 


tcccgacgcg 


gaggagaege 


17899 


tgctgacgca 


cacggacgag 


ccgcccccgt 


acgaggaggc 


ggtgaaactg 


ggtctgccca 


17959 


ccacgcggcc 


cgtggcgcct 


ctggccaccg 


gggtgctgaa 


acccagcagc 


agcagccagc 


18019 


ccgcgaccct 


ggacttgcct 


ccgcctgctt 


cccgcccctc 


cacagtggct 


aagcccctgc 


18079 


cgccggtggc 


cgtcgcgtcg 


cgcgcccccc 


gaggccgccc 


ccaggcgaac 


tggcagagca 


18139 


ctctgaacag 


catcgtgggt 


ctgggagtgc 


agagtgtgaa 


gcgccgccgc 


tgctattaaa 


18199 


agacactgta 


gcgcttaact 


tgcttgtctg 


tgtgtatatg 


tatgtccgcc 


gaccagaagg 


18259 


aggaagaggc 


gcgtcgccga 


gttgcaag atg gcc acc • 
Met Ala Thr : 


cca teg atg 
Pro Ser Met 


ctg ccc 
Leu Pro 


18311 



535 

cag tgg gcg tac atg cac ate gcc gga cag gac get teg gag tac ctg 18359 

Gin Trp Ala Tyr Met His He Ala Gly Gin Asp Ala Ser Glu Tyr Leu 
540 " 545 550 555 



agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt 
Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser 
560 565 570 



gca cct gtg caa ggc att age att aca aag gat ggt att caa ctt gga 
Ala Pro Val Gin Gly He Ser He Thr Lys Asp Gly He Gin Leu Gly 
685 " 690 695 



18407 



ctg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg 18455 
Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val 
575 580 585 

acc acc gac cgc age cag egg ctg acg ctg cgc ttc gtg ccc gtg gac 18503 
Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp 
590 595 600 

cgc gag gac aac acc tac teg tac aaa gtg cgc tac acg ctg gcc gtg 18551 
Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val 
605 610 615 

ggc gac aac cgc gtg ctg gac atg gcc age acc tac ttt gac ate cgc 18599 
Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp He Arg 
620 625 630 635 

ggc gtg ctg gat egg ggg ccc age ttc aaa ccc tac tec ggc ace gcc 18647 
Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala 
640 645 650 

tac aac age ctg get ccc aag gga gcg ccc aac act tgc cag tgg aca 18695 
Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr 
655 660 665 

tat aaa get ggt gat act gat aca gaa aaa acc tat aca tat gga aat 18743 
Tyr Lys Ala Gly Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn 
670 ' 675 680 



18791 
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act gac age gat ggt cag gca ate tat gca gac gaa act tat caa cca 18839 
Thr Asp Ser Asp Gly Gin Ala lie Tyr Ala Asp Glu Thr Tyr Gin Pro 
700 " 705 710 715 

gag cct caa gtg ggt gat get gaa tgg cat gac ate act ggt act gat 18887 
Glu Pro Gin Val Gly Asp Ala Glu Trp His Asp lie Thr Gly Thr Asp 
720 725 730 

gaa aaa tat gga ggc aga get ctt aag cct gac acc aaa atg aag cct 18935 
Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro 
735 740 745 

tgc tat ggt tct ttt gec aag cct acc aat aaa gaa gga ggc cag gca 18983 
Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala 
750 755 760 

aat gtg aaa acc gaa aca ggc ggt acc aaa gaa tat gac att gac atg 19031 
Asn Val Lys Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp lie Asp Met 
765 " 770 775 

gca ttc ttc gat aat cga agt gca get gec gee ggc eta gee cca gaa 19079 
Ala Phe Phe Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu 
780 785 790 795 

att gtt ttg tat act gag aat gtg gat ctg gaa act cca gat acc cat 19127 
lie Val Leu Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His 
800 805 810 

att gta tac aag gca ggt aca gat gac agt age tct tct ate aat ttg 19175 
lie Val Tyr Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser lie Asn Leu 
815 820 825 

ggt cag cag tec atg ccc aac aga ccc aac tac att ggc ttc aga gac 19223 
Gly Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp 
830 835 840 

aac ttt ate ggt ctg atg tac tac aac age act ggc aat atg ggt gta 19271 
Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val 
845 " 850 855 

ctg get gga cag gec tec cag ctg aat get gtg gtg gac ttg cag gac 19319 
Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp 
860 865 870 875 

aga aac acc gaa ctg tec tac cag etc ttg ctt gac tct ctg ggt gac 19367 
Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp 
880 885 890 

aga acc agg tat ttc agt atg tgg aat cag gcg gtg gac agt tat gac 19415 
Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp 
895 900 905 

ccc gat gtg cgc att att gaa aat cac ggt gtg gag gat gaa ctt cct 19463 
Pro Asp Val Arg He He Glu Asn His Gly Val Glu Asp Glu Leu Pro 
910 915 920 
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aac tat tgc ttc ccc ctg gat get gtg ggt aga act gat act tac cag 19511 
Asn Tyr Cys Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin 
925 930 935 

gga att aag gec aat ggt gat aat caa acc acc tgg acc aaa gat gat 19559 
Gly lie Lys Ala Asn Gly Asp Asn Gin Thr Thr Trp Thr Lys Asp Asp 
940 " 945 . 950 955 

act gtt aat gat get aat gaa ttg ggc aag ggc aat cct ttc gec atg 19607 
Thr Val Asn Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe Ala Met 
960 965 970 

gag ate aac ate cag gec aac ctg tgg egg aac ttc etc tac gcg aac 19655 
Glu He Asn He Gin Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn 
975 980 985 



gtg gcg ctg tac ctg ccc gac tec tac aag tac acg ccg gee aac ate 
Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn He 
990 995 1000 



19703 



acg ctg ccc acc aac acc aac acc tac gat tac atg aac ggc cgc 
Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg 
1005 1010 1015 



19748 



gtg gtg gcg ccc teg ctg gtg gac gec tac ate aac ate ggg gcg 
Val Val Ala Pro Ser Leu Val Asp Ala Tyr He Asn He Gly Ala 
1020 1025 1030 



19793 



cgc tgg teg ctg gac ccc atg gac aac gtc aac ccc 
Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro 
1035 1040 1045 



ttc aac cac 
Phe Asn His 



19838 



cac cgc aac gcg ggc ctg cga tac cgc tec atg etc ctg ggc aac 
His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn 
1050 1055 1060 



19883 



ggg cgc tac gtg ccc ttc cac ate cag gtg ccc caa aag ttt ttc 
Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin Lys Phe Phe 
1065 1070 1075 



19928 



gec ate aag age etc ctg etc ctg ccc ggg tec tac acc tac gag 
Ala He Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu 
1080 1085 1090 



19973 



tgg aac ttc cgc aag gac gtc aac atg ate ctg cag 
Trp Asn Phe Arg Lys Asp .Val Asn Met He Leu Gin 
1095 ~* " 1100 1105 



age tec etc 
Ser Ser Leu 



20018 



ggc aac gac ctg cgc acg gac ggg gec tec ate gee 
Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ala 
1110 1115 1120 



ttc acc age 
Phe Thr Ser 



20063 



ate aac etc tac gec acc ttc 
He Asn Leu Tyr Ala Thr Phe 
1125 " 1130 



ttc ccc atg gcg cac 
Phe Pro Met Ala His 
1135 



aac acc gee 
Asn Thr Ala 



20108 
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tec acg etc gag gec atg ctg cgc aac gac acc aac gac cag tec 20153 
Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser 
1140 1145 1150 

ttc aac gac tac etc teg gcg gee aac atg etc tac ccc ate ccg 20198 
Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro lie Pro 
1155 1160 1165 

gee aac gec acc aac gtg ccc ate tec ate ccc teg cgc aac tgg 20243 
Ala Asn Ala Thr Asn Val Pro lie Ser lie Pro Ser Arg Asn Trp 
1170 1175 1180 

gee gee ttc cgc ggc tgg tec ttc acg cgc etc aag acc cgc gag 20288 
Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu 
1185 1190 ' 1195 

acg ccc teg etc ggc tec ggg ttc gac ccc tac ttc gtc tac teg 20333 
Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser 
1200 1205 1210 

ggc tec ate ccc tac etc gac ggc acc ttc tac etc aac cac acc 20378 
Gly Ser He Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr 
1215 1220 1225 

ttc aag aag gtc tec ate acc ttc gac tec tec gtc age tgg ccc 20423 
Phe Lys Lys Val Ser He Thr Phe Asp Ser Ser Val Ser Trp Pro 
1230 1235 1240 

ggc aac gac cgc etc ctg acg ccc aac gag ttc gaa ate aag cgc 20468 
Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu He Lys Arg 
1245 ~ 1250 1255 

acc gtc gac gga gag ggg tac aac gtg gee cag tgc aac atg acc 20513 
Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Cys Asn Met Thr 
1260 * 1265 1270 

aag gac tgg ttc ctg gtc cag atg ctg gee cac tac aac ate ggc 20558 
Lys Asp Trp Phe Leu Val Gin Met Leu Ala His Tyr Asn He Gly 
1275 1280 1285 

tac cag ggc ttc tac gtg ccc gag ggc tac aag gac cgc atg tac 20603 
Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr 
1290 1295 1300 

tec ttc ttc cgc aac ttc cag ccc atg age cgc cag gtc gtg gac 20648 
Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp 
1305 1310 1315 

gag gtc aac tac aag gac tac cag gec gtc acc ctg gec tac cag 20693 
Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala Tyr Gin 
1320 1325 1330 

cac aac aac teg ggc ttc gtc ggc tac etc gcg ccc acc atg cgc 20738 
His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg 
1335 1340 1345 
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cag ggc cag ccc tac ccc gcc aac tac ccc tac ccg etc ate ggc 
Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He Gly 
1350 1355 1360 

aag age gcc gtc gcc age gtc acc cag aaa aag ttc etc tgc gac 
Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Pbe Leu Cys Asp 
1365 1370 1375 

egg gtc atg tgg cgc ate ccc ttc tec age aac ttc atg tec atg 
Arg Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met 
1380 * 1385 1390 

ggc gcg etc acc gac etc ggc cag aac atg etc tac gcc aac tec 
Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser 
1395 1400 1405 

gcc cac gcg eta gac atg aat ttc gaa gtc gac ccc atg gat gag 
Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu 
1410 1415 1420 

tec acc ctt etc tat gtt gtc ttc gaa gtc ttc gac gtc gtc cga 
Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg 
1425 1430 1435 

gtg cac cag ccc cac cgc ggc gtc ate gag gcc gtc tac ctg cgc 
Val His Gin Pro His Arg Gly Val He Glu Ala Val Tyr Leu Arg 
1440 1445 1450 

acg ccc ttc teg gcc ggc aac gcc acc acc taa gcctcttgct 
Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
1455 1460 

tettgeaaga tgacggcctg cgcgggctcc ggegagcagg agctcagggc catcctccgc 

gacctgggct gcgggccctg cttcctgggc accttcgaca agcgcttccc gggattcatg 

gccccgcaca agctggcctg cgccatcgtc aacacggccg gccgcgagac egggggegag 

cactggctgg ccttcgcctg gaacccgcgc tcccacacct gctacctctt cgaccccttc 

gggttctcgg acgagcgcct caagcagatc taccagttcg agtacgaggg cctgctgcgt 

cgcagcgccc tggccaccga ggaccgctgc gtcaccctgg aaaagtccac ccagaccgtg 

cagggtccgc gctcggccgc ctgcgggctc ttctgetgea tgttcctgca cgccttcgtg 

cactggcccg accgccccat ggacaagaac cccaccatga acttgetgae gggggtgccc 

aacggcatgc tccagtcgcc ccaggtggaa cccaccctgc gccgcaacca ggaggegetc 

taccgcttcc tcaacgccca ctccgcctac tttcgctccc accgcgcgcg catcgagaag 

gccaccgcct tcgaccgcat gaatcaagac atgtaatccg gtgtgtgtat gtgaatgett 

tattcatcat aataaacagc acatgtttat gccaccttct ctgaggctct gactttattt 

agaaatcgaa ggggttctgc cggctctcgg catggcccgc gggcagggat acgttgcgga 



20783 

20828 

20873 

20918 

20963 

21008 

21053 

21096 

21156 

21216 

21276 

21336 

21396 

21456 

21516 

21576 

21636 

21696 

21756 

21816 

21876 
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actggtactt 


gggcagccac 


ttgaactcgg 


ggatcagcag 


cttcggcacg 


gggaggtcgg 


21936 


ggaacgagtc 


gctccacagc 


ttgcgcgtga 


gttgcagggc 


gcccagcagg 


tcgggcgcgg 


21996 


agatcttgaa 


atcgcagttg 


ggacccgcgt 


tctgcgcgcg 


agagttacgg 


tacacggggt 


22056 


tgcagcactg 


gaacaccatc 


agggccgggt 


gcttcacgct 


cgccagcacc 


gtcgcgtcgg 


22116 


tgatgccctc 


cacgtccaga 


tcctcggcgt 


tggccatccc 


gaagggggtc 


atcttgcagg 


22176 


tctgccgccc 


catgctgggc 


acgcagccgg 


gcttgtggtt 


gcaatcgcag 


tgcaggggga 


22236 


tcagcatcat 


ctgggcctgc 


tcggagctca 


tgcccgggta 


catggccttc 


atgaaagcct 


22296 


ccagctggcg 


gaaggcctgc 


tgcgccttgc 


cgccctcggt 


gaagaagacc 


ccgcaggact 


22356 


tgctagagaa 


ctggttggtg 


gcgcagccag 


cgtcgtgcac 


gcagcagcgc 


gcgtcgttgt 


22416 


tggccagctg 


caccacgctg 


cgcccccagc 


ggttctgggt 


gatcttggcc 


cggtcggggt 


22476 


tctccttcag 


cgcgcgctgc 


ccgttctcgc 


tcgccacatc 


catctcgatc 


gtgtgctcct 


22536 


tctggatcat 


cacggtcccg 


tgcaggcacc 


gcagcttgcc 


ctcggcctcg 


gtgcacccgt 


22596 


gcagccacag 


cgcgcagccg 


gtgctctccc 


agttcttgtg 


ggcgatctgg 


gagtgcgagt 


22656 


gcacgaagcc 


ctgcaggaag 


cggcccatca 


tcgtggtcag 


ggtcttgttg 


ctggtgaagg 


22716 


tcagcggaat 


gccgcggtgc 


tcctcgttca 


catacaggtg 


gcagatacgg 


cggtacacct 


22776 


cgccctgctc 


gggcatcagc 


tggaaggcgg 


acttcaggtc 


gctctccacg 


cggtaccggt 


22836 


ccatcagcag 


cgtcatcact 


tccatgccct 


tctcccaggc 


cgaaacgatc 


ggcaggctca 


22896 


gggggttctt 


caccgttgtc 


atcttagtcg 


ccgccgccga 


agtcaggggg 


tcgttctcgt 


22956 


ccagggtctc aaacactcgc 


ttgccgtcct 


tctcggtgat 


gcgcacgggg 


ggaaagctga 


23016 


agcccacggc cgccagctcc 


tcctcggcct 


gcctttcgtc 


ctcgctgtcc 


tggctgatgt 


23076 


cttgcaaagg cacatgcttg 


gtcttgcggg 


gtttcttttt 


gggcggcaga 


ggcggcggcg 


23136 


gagacgtgct gggcgagcgc 


gagttctcgc 


tcaccacgac 


tatttcttct 


ccttggccgt 


23196 


cgtccgagac cacgcggcgg 


taggcatgcc 


tcttctgggg 


cagaggcgga 


ggcgacgggc 


23256 


tctcgcggtt cggcgggcgg 


ctggcagagc 


cccttccgcg 


ttcgggggtg 


cgctcctggc 


23316 


ggcgctgctc tgactgactt 


cctccgcggc 


cggccattgt 


gttctcctag 


ggagcaagca 


23376 


tggagactca 


gccatcgtcg 


ccaacatcgc 


catctgcccc 


cgccgccgcc 


gacgagaacc 


23436 


agcagcagca 


gaatgaaagc 


ttaaccgccc 


cgccgcccag 


ccccacctcc 


gacgccgcag 


23496 


ccccagacat gcaagagatg 


gaggaatcca 


tcgagattga 


cctgggctac 


gtgacgcccg 


23556 
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cggagcacga ggaggagctg gcagcgcgct tttcagcccc ggaagagaac caccaagagc 23616 

agccagagca ggaagcagag agcgagcaga accaggctgg gctcgagcat ggcgactacc 23676 

tgagcggggc agaggacgtg ctcatcaagc atctggcccg ccaatgcatc atcgtcaagg 23736 

acgcgctgct cgaccgcgcc gaggtgcccc tcagcgtggc ggagctcagc cgcgcctacg 23796 

agcgcaacct cttctcgccg cgcgtgcccc ccaagcgcca gcccaacggc acctgcgagc 23856 

ccaacccgcg cctcaacttc tacccggtct tcgcggtgcc cgaggccctg gccacctacc 23916 

acctcttttt caagaaccaa aggatccccg tctcctgccg cgccaaccgc acccgcgccg 23976 

acgccctgct caacctgggc cccggcgccc gcctacctga tatcgcctcc ttggaagagg 24036 

ttcccaagat cttcgagggt ctgggcagcg acgagactcg ggccgcgaac gctctgcaag 24096 

gaagcggaga ggagcatgag caccacagcg ccctggtgga gttggaaggc gacaacgcgc 24156 

gcctggcggt cctcaagcgc acggtcgagc tgacccactt cgcctacccg gcgctcaacc 24216 

tgccccccaa ggtcatgagc gccgtcatgg accaggtgct catcaagcgc gcctcgcccc 24276 

tctcggagga ggagatgcag gaccccgaga gctcggacga gggcaagccc gtggtcagcg 24336 

acgagcagct ggcgcgctgg ctgggagcga gtagcacccc ccagagcctg gaagagcggc 24396 

gcaagctcat gatggccgtg gtcctggtga ccgtggagct ggagtgtctg cgccgcttct 24456 

tcgccgacgc ggagaccctg cgcaaggtcg aggagaacct gcactacctc ttcagacacg 24516 

ggttcgtgcg ccaggcctgc aagatctcca acgtggagct gaccaacctg gtctcctaca 24576 

tgggcatcct gcacgagaac cgcctggggc agaacgtgct gcacaccacc ctgcgcgggg 24636 

aggcccgccg cgactacatc cgcgactgcg tctacctgta cctctgccac acctggcaga 24696 

cgggcatggg cgtgtggcag cagtgcctgg aggagcagaa cctgaaagag ctctgcaagc 24756 

tcctgcagaa gaacctcaag gccctgtgga ccgggttcga cgagcgcacc accgccgcgg 24816 

acctggccga cctcatcttc cccgagcgcc tgcggctgac gctgcgcaac gggctgcccg 24876 

actttatgag ccaaagcatg ttgcaaaact ttcgctcttt catcctcgaa cgctccggga 24936 

tcctgcccgc cacctgctcc gcgctgccct cggacttcgt gccgctgacc ttccgcgagt 24996 

gccccccgcc gctctggagc cactgctacc tgctgcgcct ggccaactac ctggcctacc 25056 

actcggacgt gatcgaggac gtcagcggcg agggcctgct cgagtgccac tgccgctgca 25116 

acctctgcac gccgcaccgc tccctggcct gcaaccccca gctgctgagc gagacccaga 25176 

tcatcggcac cttcgagttg caaggccccg gcgagggcaa ggggggtctg aaactcaccc 25236 

cggggctgtg gacctcggcc tacttgcgca agttcgtgcc cgaggactac catcccttcg 25296 
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agatcaggtt ctacgaggac caatcccagc cgcccaaggc cgagctgtcg gcctgcgtca 25356 

tcacccaggg ggccatcctg gcccaattgc aagccatcca gaaatcccgc caagaatttc 25416 

tgctgaaaaa gggccacggg gtctacttgg acccccagac cggagaggag ctcaacccca 25476 

gcttccccca ggatgccccg aggaagcagc aagaagctga aagtggagct gccgccgccg 25536 

ccggaggatt tggaggaaga ctgggagagc agtcaggcag aggaggagga gatggaagac 25596 

tgggacagca ctcaggcaga ggaggacagc ctgcaagaca gtctggagga ggaagacgag 25656 

gtggaggagg cagaggaaga agcagccgcc gccagaccgt cgtcctcggc ggaggaggag 25716 

aaagcaagca gcacggatac catctccgct ccgggtcggg gtcgcggcgg ccgggcccac 25776 

agtagatggg acgagaccgg gcgcttcccg aaccccacca cccagaccgg taagaaggag 25836 

cggcagggat acaagtcctg gcgggggcac aaaaacgcca tcgtctcctg cttgcaagcc 25896 

tgcgggggca acatctcctt cacccggcgc tacctgctct tccaccgcgg ggtgaacttc 25956 

ccccgcaaca tcttgcatta ctaccgtcac ctccacagcc cctactactg tttccaagaa 26016 

gaggcagaaa cccagcagca gcagcagcag cagaaaacca gcggcagcag ctagaaaatc 26076 

cacagcggcg gcaggtggac tgaggatcgc ggcgaacgag ccggcgcaga cccgggagct 26136 

gaggaaccgg atctttccca ccctctatgc catcttccag cagagtcggg ggcaagagca 26196 

ggaactgaaa gtcaagaacc gttctctgcg ctcgctcacc cgcagttgtc tgtatcacaa 26256 

gagcgaagac caacttcagc gcactctcga ggacgccgag gctctcttca acaagtactg 26316 

cgcgctcact cttaaagagt agcccgcgcc cgcccacaca cggaaaaagg cgggaattac 26376 

gtcaccacct gcgcccttcg cccgaccatc atcatgagca aagagattcc cacgccttac 26436 

atgtggagct accagcccca gatgggcctg gccgccggcg ccgcccagga ctactccacc 26496 

cgcatgaact ggctcagtgc cgggcccgcg atgatctcac gggtgaatga catccgcgcc 26556 

caccgaaacc agatactcct agaacagtca gcgatcaccg ccacgccccg ccatcacctt 26616 

aatccgcgta attggcccgc cgccctggtg taccaggaaa ttccccagcc cacgaccgta 26676 

ctacttccgc gagacgccca ggccgaagtc cagctgacta actcaggtgt ccagctggcc 26736 

ggcggcgccg ccctgtgtcg tcaccgcccc gctcagggta taaagcggct ggtgatccga 26796 

ggcagaggca cacagctcaa cgacgaggtg gtgagctctt cgctgggtct gcgacctgac 26856 

ggagtcttcc aactcgccgg atcggggaga tcttccttca cgcctcgtca ggccgtcctg 26916 

actttggaga gttcgtcctc gcagccccgc tcgggtggca tcggcactct ccagttcgtg 26976 
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gaggagttca ctccctcggt ctacttcaac cccttctccg gctcccccgg ccactacccg 27036 

gacgagttca tcccgaactt cgacgccatc agcgagtcgg tggacggcta cgattgaatg 27096 

tcccatggtg gcgcggctga cctagctcgg cttcgacacc tggaccactg ccgccgcttc 27156 

cgctgcttcg ctcgggatct cgccgagttt gcctactttg agctgcccga ggagcaccct 27216 

cagggcccgg cccacggagt gccfgatcgtc gtcgaagggg gtctcgactc ccacctgctt 27276 

cggatcttca gccagcgtcc gatcctggcc gagcgcgagc aaggacagac ccttctgacc 27336 

ctgtactgca tctgcaacca ccccggcctg catgaaagtc tttgttgtct gctgtgtact 27396 

gagtataata aaagctgaga tcagcgacta ctccggactt ccgtgtgttc ctgctatcaa 27456 

ccagtccctg ttcttcaccg ggaacgagac cgagctccag ctccagtgta agccccacaa 27516 

gaagtacctc acctggctgt tccagggctc tccgatcgcc gttgtcaacc actgcgacaa 27576 

cgacggagtc ctgctgagcg gccctgccaa ccttactttt tccacccgca gaagcaagct 27636 

ccagctcttc caacccttcc tccccgggac ctatcagtgc gtctcgggac cctgccatca 27696 

caccttccac ctgatcccga ataccacagc gtcgctcccc gctactaaca accaaactac 27756 

ccaccaacgc caccgtcgcg acctttcctc tgggtctaat accactaccg gaggtgagct 27816 

ccgaggtcga ccaacctctg ggatttacta cggcccctgg gaggtggtag ggttaatagc 27876 

gctaggccta gttgcgggtg ggcttttggc tctctgctac ctatacctcc cttgctgttc 27936 

gtacttagtg gtgctgtgtt gctggtttaa gaaatgggga agatcaccct agtgagctgc 27996 

ggtgtgctgg tggcggtggt gctttcgatt gtgggactgg gcggcgcggc tgtagtgaag 28056 

gagaaggccg atccctgctt gcatttcaat cccgacaaat gccagctgag ttttcagccc 28116 

gatggcaatc ggtgcgcggt gctgatcaag tgcggatggg aatgcgagaa cgtgagaatc 28176 

gagtacaata acaagactcg gaacaatact ctcgcgtccg tgtggcagcc cggggacccc 28236 

gagtggtaca ccgtctctgt ccccggtgct gacggctccc cgcgcaccgt gaataatact 28296 

ttcatttttg cgcacatgtg cgacacggtc atgtggatga gcaagcagta cgatatgtgg 28356 

ccccccacga aggagaacat cgtggtcttc tccatcgctt acagcgtgtg cacggcgcta 28416 

atcaccgcta tcgtgtgcct gagcattcac atgctcatcg ctattcgccc cagaaataat 28476 

gccgaaaaag aaaaacagcc ataacacgtt ttttcacaca cctttttcag accatggcct 28536 

ctgttaaatt tttgctttta tttgccagtc tcattgccgt cattcatgga atgagtaatg 28596 

agaaaattac tatttacact ggcactaatc acacattgaa aggtccagaa aaagccacag 28656 

aagtttcatg gtattgttat tttaatgaat cagatgtatc tactgaactc tgtggaaaca 28716 
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ataacaaaaa aaatgagagc attactctca tcaagtttca atgtggatct gacttaaccc 28776 

taattaacat cactagagac tatgtaggta tgtattatgg aactacagca ggcatttcgg 28836 

acatggaatt ttatcaagtt tctgtgtctg aacccaccac gcctagaatg accacaacca 28896 

caaaaactac acctgttacc actatacagc tcactaccaa tggctttctt gccatgcttc 28956 

aagtggctga aaatagcacc agcattcaac ccaccccacc cagtgaggaa attcccagat 29016 

ccatgattgg cattattgtt gctgtagtgg tgtgcatgtt gatcatcgcc ttgtgcatgg 29076 

tgtactatgc cttctgctac agaaagcaca gactgaacga caagctggaa cacttactaa 29136 

gtgttgaatt ttaatttttt agaaccatga agatcctagg ccttttagtt ttttctatca 29196 

ttacctctgc tctatgcaat tctgacaatg aggacgttac tgtcgttgtc ggatcaaatt 29256 

atacactaaa aggtccagca aaa.ggtatgc tttcgtggta ttgttggttc ggaactgacg 29316 

agcaacagac agaactttgc aatgctcaaa aaggcaaaac ctcaaattct aaaatctcta 29376 

attatcaatg caatggcact gacttagtat tgctcaatgt cacgaaagca tatgctggca 29436 

gttacacctg ccctggagat gatgccgaca atatgatttt ttacaaagtg gaagtggttg 29496 

atcccactac tccaccgccc accaccacaa ctactcatac cacacacaca gaacaaacac 29556 

cagaggcagc agaagcagag ttggccttcc aggttcacgg agattccttt gctgtcaata 29616 

cccctacacc cgatcagcgg tgtccggggc tgctcgtcag cggcattgtc ggtgtgcttt 29676 

cgggattagc agtcataatc atctgcatgt tcatttttgc ttgctgctat agaaggcttt 29736 

accgacaaaa atcagaccca ctgctgaacc tctatgttta attttttcca gagccatgaa 29796 

ggcagttagc gctctagttt tttgttcttt gattggcatt gtttttagtg ctgggttttt 29856 

gaaaaatctt accatttatg aaggtgagaa tgccactcta gtgggcatca gtggtcaaaa 29916 

tgtcagctgg ctaaaatacc atctagatgg gtggaaagac atttgcgatt ggaatgtcac 29976 

tgtgtataca tgtaatggag ttaacctcac cattactaat gccacccaag atcagaatgg 30036 

taggtttaag ggccagagtt tcactagaaa taatgggtat gaatcccata acatgtttat 30096 

ctatgacgtc actgtcatca gaaatgagac tgccaccacc acacagatgc ccactacaca 30156 

cagttctacc actactacca tgcaaaccac acagacaacc actacatcaa ctcagcatat 30216 

gaccaccact acagcagcaa agccaagtag tgcagcgcct cagccccagg ctttggcttt 30276 

gaaagctgca caacctagta caactactag gaccaatgag cagactactg aatttttgtc 30336 

cactgtcgag agccacacca cagctacctc cagtgccttc tctagcaccg ccaatctctc 30396 
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ctcgctttcc tctacaccaa tcagtcccgc tactactccc accccagctc ttctccccac 30456 

tcccctgaag caaactgagg acagcggcat gcaatggcag atcaccctgc tcattgtgat 30516 

cgggttggtc atcctggccg tgttgctcta ctacatcttc tgccgccgca ttcccaacgc 30576 

gcaccgcaaa ccggcctaca agcccatcgt tatcgggcag ccggagccgc ttcaggtgga 30636 

agggggtcta aggaatcttc tcttctcttt tacagtatgg tgattgaact atgattccta 30696 

gacaattctt gatcactatt cttatctgcc tcctccaagt ctgtgccacc ctcgctctgg 30756 

tggccaacgc cagtccagac tgtattgggc ccttcgcctc ctacgtgctc tttgccttca 30816 

tcacctgcat ctgctgctgt agcatagtct gcctgcttat caccttcttc cagttcattg 30876 

actggatctt tgtgcgcatc gcctacctgc gccaccaccc ccagtaccgc gaccagcgag 30936 

tggcgcggct gctcaggctc ctctgataag catgcgggct ctgctacttc tcgcgcttct 30996 

gctgttagtg ctcccccgcc ccgtcgaccc ccggtccccc actcagtccc ccgaagaggt 31056 

ccgcaaatgc aaattccaag aaccctggaa attcctcaaa tgctaccgcc aaaaatcaga 31116 

catgcttccc agctggatca tgatcattgg gatcgtgaac attctggcct gcaccctcat 31176 

ctcctttgtg atttacccct gctttgactt tggttggaac tcgccagagg cgctctatct 31236 

cccgcctgaa cctgacacac caccacagca acctcaggca cacgcactac caccaccaca 31296 

gcctaggcca caatacatgc ccatattaga ctatgaggcc gagccacagc gacccatgct 31356 

ccccgctatt agttacttca atctaaccgg cggagatgac tgacccactg gccaaaaaca 31416 

acgtcaacga ccttctcctg gacatggacg gccgcgcctc ggagcagcga ctcgcccaac 31476 

ttcgcattcg ccagcagcag gagagagccg tcaaggagct gcaggacggc atagccatcc 31536 

accagtgcaa gaaaggcatc ttctgcctgg tgaaacaggc caagatctcc tacgaggtca 31596 

ccccgaccga ccatcgcctc tcctacgagc tcctgcagca gcgccagaag ttcacctgcc 31656 

tggtcggagt caaccccatc gtcatcaccc agcagtcggg cgataccaag gggtgcatcc 31716 

actgctcctg cgactccccc gactgcgtcc acactctgat caagaccctc tgcggcctcc 31776 

gcgacctcct ccccatgaac taatcacccc cttatccagt gaaataaata tcatattgat 31836 

gatgatttaa ataaaaaata atcatttgat ttgaaataaa gatacaatca tattgatgat 31896 

ttgagtttta aaaaataaag aatcacttac ttgaaatctg ataccaggtc tctgtccatg 31956 

ttttctgcca acaccacctc actcccctct tcccagctct ggtactgcag accccggcgg 32016 

gctgcaaact tcctccacac gctgaagggg atgtcaaatt cctcctgtcc ctcaatcttc 32076 
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attttatctt ctatcag atg tec 

Met Ser 
1465 



aaa aag cgc gtc egg gtg gat gat gac 
Lys Lys Arg Val Arg Val Asp Asp Asp 
1470 



32126 



ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 
Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val 
1475 1480 1485 



32171 



ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga ttc caa gag 
Pro Phe lie Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gin Glu 
1490 1495 - 1500 



32216 



aag ccc ctg ggg gtg ctg tec ctg cga ctg get gac ccc gtc acc 
Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr 
1505 1510 1515 



32261 



acc aag aac ggg gaa ate acc etc aag ctg gga gag ggg gtg gac 
Thr Lys Asn Gly Glu lie Thr Leu Lys Leu Gly Glu Gly Val Asp 
1520 1525 1530 



32306 



etc gac tec teg gga aaa etc ate tec aac acg gee acc aag gec 
Leu Asp Ser Ser Gly Lys Leu lie Ser Asn Thr Ala Thr Lys Ala 
1535 1540 1545 



32351 



gec gee cct etc agt ttt tec aac aac acc att tec ctt aac atg 
Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr lie Ser Leu Asn Met 
1550 1555 1560 



32396 



gat acc cct ctt tat acc aaa gat gga aaa tta 
Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu 
1565 1570' " 1575 



tec tta caa gtt 
Ser Leu Gin Val 



32441 



tct cca ccg tta aac ata tta aaa tea acc att ctg aac aca tta 
Ser Pro Pro Leu Asn He Leu Lys Ser Thr He Leu Asn Thr Leu 
1580 1585 1590 



32486 



get gta get tat gga tea ggt tta gga ctg agt ggt ggc act get 
Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly Thr Ala 
1595 1600 1605 



32531 



ctt gca gta cag ttg gec 
Leu Ala Val Gin Leu Ala 
1610 1615 



tct cca etc act ttt gat gaa aaa gga 
Ser Pro Leu Thr Phe Asp Glu Lys Gly 
1620 



32576 



aat att aaa att aac eta gee agt ggt cca tta aca gtt gat gca 
Asn He Lys He Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala 
1625 1630 1635 



32621 



agt cga ctt agt ate aac tgc aaa aga ggg gtc act gtc act acc 
Ser Arg Leu Ser He Asn Cys Lys Arg Gly Val Thr Val Thr Thr 
1640 1645 1650 



32666 



tea gga gat gca att gaa age aac ata age tgg cct aaa ggt ata 
Ser Gly Asp Ala He Glu Ser Asn He Ser Trp Pro Lys Gly He 
1655 1660 1665 



32711 
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aga ttt gaa ggt aat ggc ata get gca aac att ggc aga gga ttg 
Arg Phe Glu Gly Asn Gly He Ala Ala Asn He Gly Arg Gly Leu 
1670 * 1675 1680 



32756 



gaa ttt gga acc act agt aca gag act gat gtc aca gat gca tac 
Glu Phe Gly Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr 
1685 1690 1695 



32801 



cca att caa gtt aaa ttg ggt act ggc ctt acc ttt gac agt aca 
Pro He Gin Val Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr 
1700 ~ 1705 1710 



32846 



ggc gec att gtt get tgg aac aaa gag gat gat aaa ctt aca tta 
Gly Ala He Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu 
1715 1720 1725 



32891 



tgg acc aca gec gac ccc teg cca aat tgc aaa ata tac tct gaa 
Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys He Tyr Ser Glu 
1730 1735 1740 



32936 



aaa gat gec aaa etc aca ctt tgc ttg aca aag tgt gga agt caa 
Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
1745 1750 1755 



32981 



att ctg ggt act gtg act gta ttg gca gtg aat aat gga agt etc 
He Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu 
1760 1765 1770 



33026 



aac cca ate aca aac aca gta age act gca etc gtc tec etc aag 
Asn Pro He Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys 
1775 1780 1785 



33071 



ttt gat gca agt gga gtt ttg eta age age tec aca tta gac aaa 
Phe Asp Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys 
1790 1795 1800 



33116 



gaa tat tgg aac ttc aga aag gga gat gtt aca cct get gag ccc 
Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro 
1805 1810 1815 



33161 



tat act aat get ata ggt ttt atg cct aac ata aag gec tat cct 
Tyr Thr Asn Ala He Gly - Phe Met Pro Asn He Lys Ala Tyr Pro 
1820 1825 1830 



33206 



aaa aac aca tct gca get tea aaa age cat att gtc agt caa gtt 
Lys Asn Thr Ser Ala Ala Ser Lys Ser His He Val Ser Gin Val 
1835 1840 1845 



33251 



tat etc aat ggg gat gag gee aaa cca ctg atg ctg att att act 
Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu He He Thr 
1850 1855 1860 



33296 



ttt aat gaa act gag gat gca act tgc acc tac agt ate act ttt 
Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser He Thr Phe 
1865 1870 1875 



33341 
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caa tgg aaa tgg gat agt act aag tac aca ggt gaa aca ctt get 33386 

Trp LyS Trp Ser Thr T * r Thr G1 V Thr Leu Ala 

1880 1885 X890 

acc age tec ttc ace ttc • tec tac ate gec caa gaa tga acactgtatc 33435 
Thr Ser Ser Phe Thr Phe Ser Tyr lie Ala Gin Glu 
1895 1900 190 5 

ccaccctgca tgccaaccct tcccacccca ctctgtctat ggaaaaaact ctgaagcaca 33495 
aaataaaata aagttcaagt gttttattga ttcaacagtt ttacaggatt cgagcagtta 33555 
tttttcctcc accctcccag gacatggaat acaccaccct ctccccccgc acagecttga 33615 
acatctgaat gccattggtg atggacatgc ttttggtctc cacgttccac acagtttcag 33675 
agegagecag tetegggteg gtcagggaga tgaaaccctc cgggcactcc cgcatctgca 33735 
cctcacagct caacagctga ggattgtcct cggtggtcgg gatcaeggtt atctggaaga 33795 
agcagaagag cggcggtggg aatcatagtc cgcgaacggg ateggceggt ggtgtcgcat 33855 
caggccccgc ageagtcget gccgccgccg ctccgtcaag ctgctgctca gggggtcegg 33915 
gtccagggac tccctcagca tgatgcccac ggccctcagc ateagtegtc tggtgcggcg 33975 
ggcgcagcag cgcatgcgga tctcgctcag gtcgctgcag tacgtgcaac acaggaccac 34035 
caggttgttc aacagtccat agttcaacac gctccagccg aaactcatcg egggaaggat 34095 
gctacccacg tggccgtcgt accagatcct caggtaaatc aagtggcgct ccctccagaa 34155 
cacgctgccc aegtacatga tctccttggg catgtggcgg ttcaccacct cccggtacca 34215 
catcaccctc tggttgaaca tgcagccccg gatgatcctg cggaaccaca gggccagcac 34275 
cgccccgccc gccatgcagc gaagagaccc cgggtcccgg caatggcaat ggaggaccca 34335 
ccgctcgtac ccgtggatca tctgggagct gaacaagtct atgttggcac agcacaggca 34395 
tatgetcatg catctcttca gcactctcag ctcctcgggg gtcaaaacca tatcccaggg 34455 
caeggggaac tettgeagga cagcgaaccc cgcagaacag ggcaatcctc gcacataact 34515 
tacattgtgc atggacaggg tategcaate aggcagcacc gggtgatcct ccaccagaga 34575 
agegegggtc tcggtctcct cacagcgtgg taagggggee ggccgatacg ggtgatggcg 34635 
ggacgegget gatcgtgttc gcgaccgtgt catgatgeag ttgetttegg acattttcgt 34695 
acttgctgta gcagaacctg gtccgggcgc tgcacaccga tcgccggcgg cggtcccggc 34755 
gcttggaacg ctcggtgttg aaattgtaaa acagccactc tctcagaccg tgcagcagat 34815 
ctagggcctc aggagtgatg aagatcccat catgectgat agctctgatc acatcgacca 34875 
ccgtggaatg ggccagaccc agecagatga tgcaattttg ttgggtttcg gtgaeggegg 34935 
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gggagggaag 


aacaggaaga 


accatgatta 


acttttaatc 


caaacggtct 


cggagcactt 


34995 


caaaatgaag 


gtcgcggaga 


tggcacctct 


cgcccccgct 


gtgttggtgg 


aaaataacag 


35055 


ccaggtcaaa 


ggtgatacgg 


ttctcgagat 


gttccacggt 


ggcttccagc 


aaagcctcca 


35115 


cgcgcacatc 


cagaaacaag 


acaatagcga 


aagcgggagg 


gttctctaat 


tcctcaatca 


35175 


tcatgttaca 


ctcctgcacc 


atccccagat 


aattttcatt 


tttccagcct 


tgaatgattc 


35235 


gaactagttc 


ctgaggtaaa 


tccaagccag 


ccatgataaa 


gagctcgcgc 


agagcgccct 


35295 


ccaccggcat 


tcttaagcac 


accctcataa 


ttccaagata 


ttctgctcct 


ggttcacctg 


35355 


cagcagattg 


acaagcggaa 


tatcaaaatc 


tctgccgcga 


tccctaagct 


cctccctcag 


35415 


caataactgt 


aagtactctt 


tcatatcctc 


tccgaaattt 


ttagccatag 


gaccaccagg 


35475 


aataagatta 


gggcaagcca 


cagtacagat 


aaaccgaagt 


cctccccagt 


gagcattgcc 


35535 


aaatgcaaga 


ctgctataag 


catgctggct 


agacccggtg 


atatcttcca 


gataactgga 


35595 


cagaaaatca 


cccaggcaat 


ttttaagaaa 


atcaacaaaa 


gaaaaatcct 


ccaggtgcac 


35655 


gtttagagcc 


tcgggaacaa 


cgatgaagta 


aatgcaagcg 


gtgcgttcca 


gcatggttag 


35715 


ttagctgatc 


tgtaaaaaac 


aaaaaataaa 


acattaaacc 


atgctagcct 


ggcgaacagg 


35775 


tgggtaaatc 


gttctctcca 


gcaccaggca 


ggccacgggg 


tctccggcgc 


gaccctcgta 


35835 


aaaattgtcg 


ctatgattga 


aaaccatcac 


agagagacgt 


tcccggtggc 


cggcgtgaat 


35895 


gattcgacaa 


gatgaataca 


cccccggaac 


attggcgtcc 


gcgagtgaaa 


aaaagcgccc 


35955 


gaggaagcaa 


taaggcacta 


caatgctcag 


tctcaagtcc 


agcaaagcga 


tgccatgcgg 


36015 


atgaagcaca 


aaatcctcag 


gtgcgtacaa 


aatgtaatta 


ctcccctcct 


gcacaggcag 


36075 


cgaagccccc 


gatccctcca 


gatacacata 


caaagcctca 


gcgtccatag 


cttaccgagc 


36135 


agcagcacac 


aacaggcgca 


agagtcagag 


aaaggctgag 


ctctaacctg 


tccacccgct 


36195 


ctctgctcaa 


tatatagccc 


agatctacac 


tgacgtaaag 


gccaaagtct 


aaaaataccc 


36255 


gccaaataat 


cacacacgcc 


cagcacacgc 


ccagaaaccg 


gtgacacact 


caaaaaaata 


36315 


cgcgcacttc 


ctcaaacgcc 


caaactgccg 


tcatttccgg 


gttcccacgc 


tacgtcatcg 


36375 


gaattcgact 


ttcaaattcc 


gtcgaccgtt 


aaaaacgtca 


cccgccccgc 


ccctaacggt 


36435 


cgcccgtctc 


tcggccaatc 


accttcctcc 


ctccccaaat 


tcaaacagct 


catttgcata 


36495 


ttaacgcgca 


ccaaaagttt 


gaggtatatt 


attgatgatg 






36535 
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<210> 10 

<211> 531 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 

<400> 10 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
1 5 10 15 

Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
50 55 60 

Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser Leu Asn Tyr Gin Asn 
65 70 75 eo 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe Asp Glu Arg Ser Arg 
100 105 no 

Trp Gly Gly Gin Leu Lys Thr He Met His Thr Asn Met Pro Asn Val 
115 120 125 

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 
130 135 140 

Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn Tyr Asp Gly Ser 
145 150 155 160 

Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly 
165 170 175 

Asn Phe Ser Val Thr Met Thr He Asp Leu Met Asn Asn Ala He He 
180 185 190 

Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp 
195 200 205 

He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro 
210 215 220 

Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His 
225 230 235 240 

Pro Asp He Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser 
245 250 255 

Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu 
260 265 270 
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Gly Phe Gin lie Leu Tyr Glu Asp Leu Glu Gly Gly Asn lie Pro Ala 
275 ~ • 280 285 

Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala 
290 & 295 300 

Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp 
305 310 315 320 

Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala Glu Thr Glu Ser 
325 330 335 

Lys lie Val lie Gin Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr 
340 345 350 

Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu 
355 360 365 

Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu 
370 * 375 380 

Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser* 
385 390 395 400 

Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin 
405 410 415 

Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser 
420 425 430 

Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala 
435 440 445 

Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin lie 
450 455 460 

Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val 
465 470 475 480 

Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser He Arg 
485 490 495 

Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro 
500 505 510 

Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser 
515 520 525 

Arg Thr Phe 
530 

<210> 11 

<2H> 932 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 
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<400> 11 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His He Ala 
15 10 15 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 * 90 95 

Ala Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Gly Asp Thr Asp Thr 
130 135 140 

Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly He Ser He 
145 150 155 160 

Thr Lys Asp Gly He Gin Leu Gly Thr Asp Ser Asp Gly Gin Ala He 
165 170 175 

Tyr Ala Asp Glu Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala Glu 
180 185 190 

Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu 
195 200 205 

Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro 
210 215 220 

Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly Gly 
225 ~ " 230 235 240 

Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe Asp Asn Arg Ser Ala 
245 250 255 

Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr Thr Glu Asn Val 



260 



265 



270 



Asp Leu 



Glu 

275 



Thr 



Pro 



Asp Thr 



His 
280 



He Val Tyr Lys 



Ala 
285 



Gly Thr Asp 
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Asp Ser Ser Ser Ser lie Asn Leu Gly Gin Gin Ser Met Pro Asn Arg 
290 295 300 

Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr 
305 ~ 310 315 320 

Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
325 330 335 

Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
340 345 350 

Leu Leu Leu Asp Ser Leu (Sly Asp Arg Thr Arg Tyr Phe Ser Met Trp 
355 360 365 

Asn Gin Ma Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn 
370 375 380 

His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala 
385 " " 390 395 400 

Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Asp Asn 
405 410 415 

Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu 
420 425 430 

Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn Leu 
435 440 445 

Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser 
450 455 460 

Tyr Lys Tyr Thr Pro Ala Asn He Thr Leu Pro Thr Asn Thr Asn Thr 
465 470 475 480 

Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala 
485 490 495 

Tyr He Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val 
500 505 510 . 

Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met 
515 520 525 

Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin 
530 535 540 

Lys Phe Phe Ala He Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr 
545 550 555 560 

Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser Ser 
565 570 575 

Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ala Phe Thr Ser 
580 585 590 
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lie Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser 
595 600 605 

Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn 
610 615 620 

Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro lie Pro Ala Asn Ala 
625 630 635 640 

Thr Asn Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala Phe Arg 
645 650 655 

Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly 
660 665 670 

Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser He Pro Tyr Leu 
675 680 685 

Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser lie Thr 
690 695 " 700 

Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro 
705 710 715 720 

Asn Glu Phe Glu He Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val 
725 730 735 

Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu Ala 
740 " 745 750 

His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys 
755 760 765 

Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin 
770 775 780 

Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala 
785 790 795 800 

Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met 
805 810 815 

Arg Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He Gly 
820 825 830 

Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp Arg 
835 840 845 

Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala 
850 855 860 

Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala 
865 870 875 880 



Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu 
885 890 895 
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Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His 
900 905 910 

Arg Gly Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly 
915 920 925 

Asn Ala Thr Thr 
930 



<210> 12 
<211> 443 
<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 
<400> 12 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
15 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe lie Asn Pro Pro 
20 * 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu He Thr Leu 
50 55 60 

Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu He Ser 
65 70 75 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 
85 90 95 

He Ser Leu Asn Met Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu 
100 105 HO 

Ser Leu Gin Val Ser Pro Pro Leu Asn He Leu Lys Ser Thr He Leu 
115 120 125 

Asn Thr Leu Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly 
130 135 140 

Thr Ala Leu Ala Val Gin Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys 
145 150 155 160 

Gly Asn He Lys He Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala 
165 170 175 

Ser Arg Leu Ser He Asn Cys Lys Arg Gly Val Thr Val Thr Thr Ser 
180 185 190 

Gly Asp Ala He Glu Ser Asn He Ser Trp Pro Lys Gly He Arg Phe 
195 200 205 
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Glu Gly Asn Gly lie Ala Ala Asn lie Gly Arg Gly Leu Glu Phe Gly 
210 215 220 

Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr Pro lie Gin Val 
225 230 235 240 

Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr Gly Ala lie Val Ala 
245 * 250 " 255 

Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala Asp Pro 
260 265 270 

Ser Pro Asn Cys Lys lie Tyr Ser Glu Lys Asp Ala Lys Leu Thr Leu 
275 280 285 

Cys Leu Thr Lys Cys Gly Ser Gin He Leu Gly Thr Val Thr Val Leu 
290 295 300 

Ala Val Asn Asn Gly Ser Leu Asn Pro He Thr Asn Thr Val Ser Thr 
305 ^ 310 315 320 

Ala Leu Val Ser Leu Lys Phe Asp Ala Ser Gly Val Leu Leu Ser Ser 
325 330 335 

Ser Thr Leu Asp Lys Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr 
340 345 350 

Pro Ala Glu Pro Tyr Thr Asn Ala He Gly Phe Met Pro Asn He Lys 
355 360 ~ 365 

Ala Tyr Pro Lys Asn Thr Ser Ala Ala Ser Lys Ser His He Val Ser 
370 " 375 380 

Gin Val Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu He He 
385 390 395 400 

Thr Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser He Thr Phe 
405 410 415 

Gin Trp Lys Trp Asp Ser Thr Lys Tyr Thr Gly Glu Thr Leu Ala Thr 
420 425 430 

Ser Ser Phe Thr Phe Ser Tyr He Ala Gin Glu 
435 440 



<210> 13 
<211> 338 
<212> PRT 

<213> simian serotype CI 
<400> 13 

Ala Pro Lys Gly Ala Pro Asn Thr Ser Gin Trp Leu Asp Lys Gly Val 
15 10 15 
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Thr Thr Thr Asp Asn Asn Thr Glu Asn Gly Asp Glu Glu Asp Glu Val 
20 25 30 

Ala Glu Glu Gly Glu Glu Glu Lys Gin Ala Thr Tyr Thr Phe Gly Asn 
35 40 45 

Ala Pro Val Lys Ala Glu Ala Glu He Thr Lys Glu Gly Leu Pro He 
50 55 60 

Gly Leu Glu Val Pro Ser Glu Gly Asp Pro Lys Pro He Tyr Ala Asp 
65 70 75 80 

Lys Leu Tyr Gin Pro Glu Pro Gin Val Gly Glu Glu Ser Trp Thr Asp 
85 90 95 

Thr Asp Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Glu 
100 105 HO 

Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Val 
115 120 125 

Lys Gly Gly Gin Ala Lys Val Lys Lys Val Glu Glu Gly Lys Val Glu 
130 135 140 

Tyr Asp He Asp Met Asn Phe Phe Asp Leu Arg Ser Gin Lys Thr Gly 
145 150 155 160 

Leu Lys Pro Lys He Val Met Tyr Ala Glu Asn Val Asp Leu Glu Thr 
165 170 175 

Pro Asp Thr His Val Val Tyr Lys Pro Gly Ala Ser Asp Ala Ser Ser 
180 185 190 

His Ala Asn Leu Gly Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr He 
195 200 205 

Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly 
210 215 220 

Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val 
225 " 230 235 240 

Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp 
245 250 255 

Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val 
260 265 270 

Asp Ser Tyr Asp Pro Asp Val Arg Val He Glu Asn His Gly Val Glu 
275 280 285 

Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Val Gly Pro Arg 
290 295 300 

Thr Asp Ser Tyr Lys Gly He Glu Thr Asn Gly Asp Glu Asn Thr Thr 
305 310 315 320 
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Trp Lys Asp Leu Asp Pro Asn Gly He Ser Glu Leu Ala Lys Gly Asn 
325 330 335 

Pro Phe 



<210> 14 

<211> 315 

<212> PRT 

<213> chimpanzee adenovirus Pan- 9 

<400> 14 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp 
1 5 10 15 

Gly Glu Thr Ala Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val 
20 25 30 

Gin Gly He Asn He Thr Lys Asp Gly He Gin Leu Gly Thr Asp Thr 
35 40 45 

Asp Asp Gin Pro He Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 
50 55 60 

Val Gly Asp Ala Glu Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr 
65 70 75 80 

Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly 
85 90 " * 95 

Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys 
100 105 " 110 

Thr Gly Thr Gly Thr Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe 
115 120 125 

Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu 
130 135 140 

Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr 
145 150 155 160 

Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin 
165 170 175 

Ala Met Pro Asn Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He 
180 185 190 

Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly 
195 200 205 

Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr 
210 215 220 
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Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg 
225 230 235 240 

Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val 
245 250 255 

Arg lie lie Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys 
260 265 270 

Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys 
275 280 285 

Ala Asn Gly Thr Asp Gin Thr Thr Trp Thr Lys Asp Asp Ser Val Asn 
290 ~ 295 300 

Asp Ala Asn Glu He Gly Lys Gly Asn Pro Phe 
305 310 315 



<210> 15 
<211> 315 
<212> PRT 

<213> chimpanzee adenovirus Pan-5 
<400> 15 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp 
15 10 15 

Gly Asp Thr Gly Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val 
20 25 30 

Gin Gly He Ser He Thr Lys Asp Gly He Gin Leu Gly Thr Asp Thr 
35 40 45 

Asp Asp Gin Pro He Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 
50 55 60 

Val Gly Asp Ala Glu Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr 
65 70 75 80 

Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly 
85 90 95 

Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys 
100 105 110 

Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe 
115 120 125 

Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu 
130 135 140 

Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr 
145 150 155 160 
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Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser lie Asn Leu Gly Gin Gin 
165 170 175 

Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe He 
180 185 190 

Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly 
195 200 205 

Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr 
210 215 220 

Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg 
225 230 235 240 

Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val 
245 250 255 

Arg He He Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys 
260 265 270 

Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys 
275 280 285 

Ala Asn Gly Ala Asp Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn 
290 295 300 

Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe 
305 310 315 



<210> 16 

<211> 324 

<212> PRT 

<213> chimpanzee adenovirus Pan-6 

<400> 16 

Ala Pro Lys Gly Ala Pro Asn Ser Ser Gin Trp Glu Gin Ala Lys Thr 
1 5 10 15 

Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met 
20 25 30 

Gly Gly Glu Asn He Thr Lys Asp Gly Leu Gin He Gly Thr Asp Val 
35 40 45 

Thr Ala Asn Gin Asn Lys Pro He Tyr Ala Asp Lys Thr Phe Gin Pro 
50 55 60 

Glu Pro Gin Val Gly Glu Glu Asn Trp Gin Glu Thr Glu Asn Phe Tyr 
65 " 70 75 80 

Gly Gly Arg Ala Leu Lys Lys Asp Thr Lys Met Lys Pro Cys Tyr Gly 
85 90 95 
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Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Lys Leu Lys 
100 105 110 

Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp lie Asp Leu Ala 
115 120 125 

Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gin Asp Glu Tyr Lys 
130 135 140 

Ala Asp lie Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp 
145 150 155 160 

Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu lie 
165 170 175 

Asn Leu Val Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe 
180 185 190 

Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met 
195 200 205 

Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu 
210 215 220 

Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu 
225 230 235 240 

Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser 
245 250 255 

Tyr Asp Pro Asp Val Arg He He Glu Asn His Gly Val Glu Asp Glu 
260 265 270 

Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala 
275 280 285 

Tyr Gin Gly Val Lys Val Lys Asp Gly Gin Asp Gly Asp Val Glu Ser 
290 295 300 

Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gin Leu Cys Lys 
305 310 315 320 



Gly Asn He Phe 



<210> 17 
<211> 314 
<212> PRT 

<213> chimpanzee adenovirus Pan- 7 
<400> 17 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Gly 
15 10 15 

Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin 
20 25, 30 
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Gly lie Ser He Thr Lys Asp Gly He Gin Leu Gly Thr Asp Ser Asp 
35 40 45 

Gly Gin Ala He Tyr Ala Asp Glu Thr Tyr Gin Pro Glu Pro Gin Val 
50 55 60 

Gly Asp Ala Glu Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr Gly 
65 70 75 80 

Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser 
85 90 95 

Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr 
100 105 110 

Glu Thr Gly Gly Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe Asp 
115 ~ 120 125 

Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr 
130 135 140 

Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys 
145 150 155 160 

Ala Gly Thr Asp Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin Ser 
165 170 175 

Met Pro Asn Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly 
180 185 190 

Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin 
195 200 205 

Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu 
210 215 220 

Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr 
225 230 235 240 

Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg 
245 250 255 

He He Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe 
260 265 270 

Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala 
275 280 285 

Asn Gly Asp Asn Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp 
290 295 " 300 



Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe 
305 310 
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<210> 18 
<211> 179 
<212> PRT 

<213> chimpanzee adenovirus Pan9 
<400> 18 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin lie Leu Ala' 
15 10 15 

Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 " 25 30 

He Leu Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Asn Leu Asn 
35 40 45 

Pro He Thr Gly Thr Val Ser Ser Ala Gin Val Phe Leu Arg Phe Asp 
50 55 60 

Ala Asn Gly Val Leu Leu Thr Glu His Ser Thr Leu Lys Lys Tyr Trp 
65 70 75 80 

Gly Tyr Arg Gin Gly Asp Ser He Asp Gly Thr Pro Tyr Thr Asn Ala 
85 90 95 

Val Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Ser Gin Ser Ser 
100 105 HO 

Thr Thr Lys Asn Asn He Val Gly Gin Val Tyr Met Asn Gly Asp Val 
115 120 125 

Ser Lys Pro Met Leu Leu Thr He Thr Leu Asn Gly Thr Asp Asp Ser 
130 135 140 

Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr Thr Trp Thr Asn Gly Ser 
145 150 155 160 

Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser Tyr Thr Phe Ser Tyr He 
165 170 175 



Ala Gin Glu 



<210> 19 

<211> 185 

<212> PRT 

<213> chimpanzee adenovirus Pan6 

<400> 19 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Leu Leu Ser 
1 5 10 15 

Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 30 
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He Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu 
35 40 45 

Asn Pro He Asn Asp Thr Val Lys Ser Ala He Val Phe Leu Arg Phe 
50 55 60 

Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr 
65 70 75 ■ 80 

Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr Thr Asn 
85 90 95 

Ala Val Gly Phe Met Pro Asn He Gly Ala Tyr Pro Lys Thr Gin Ser 
100 105 110 

Lys Thr Pro Lys Asn Ser He Val Ser Gin Val Tyr Leu Thr Gly Glu 
115 120 125 

Thr Thr Met Pro Met Thr Leu Thr He Thr Phe Asn Gly Thr Asp Glu 
130 135 140 

Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr Trp Gin 
145 150 155 160 

Trp Thr Gly Asp Tyr Lys Asp Lys Asn He Thr Phe Ala Thr Asn Ser 
165 170 175 

Phe Ser Phe Ser Tyr He Ala Gin Glu 
180 185 



<210> 20 
<211> 179 
<212> PRT 

<213> chimpanzee adenovirus Pan7 
<400> 20 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys He Tyr Ser 
15 10 15 

Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 " 30 

He Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu Asn 
35 40 45 

Pro He Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys Phe Asp 
50 55 60 

Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys Glu Tyr Trp 
65 70 75 ** 80 

Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro Tyr Thr Asn Ala 
85 90 95 
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lie Gly Phe Met Pro Asn He Lys Ala Tyr Pro Lys Asn Thr Ser Ala 
100 105 no 

Ala Ser Lys Ser His He Val Ser Gin Val Tyr Leu Asn Gly Asp Glu 
115 120 125 

Ala Lys Pro Leu Met Leu He He Thr Phe Asn Glu Thr Glu Asp Ala 
130 135 140 

Thr Cys Thr Tyr Ser He Thr Phe Gin Trp Lys Trp Asp Ser Thr Lys 
I 45 150 155 160 

Tyr Thr Gly Glu Thr Leu Ala Thr Ser Ser Phe Thr Phe Ser Tyr He 
165 170 175 

Ala Gin Glu 



<210> 21 

<211> 179 

<212> PRT 

<213> chimpanzee adenovirus Pan5 

<400> 21 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His He Tyr Ser 
1 5 10 15 

Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 ^ 30 

He Leu Gly Thr Val Ser Leu He Ala Val Asp Thr Gly Ser Leu Asn 
35 40 45 

Pro He Thr Gly Thr Val Thr Thr Ala Leu Val Ser Leu Lys Phe Asp 
50 55 60 

Ala Asn Gly Val Leu Gin Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp 
65 70 75 80 

Asn Phe Arg Gin Gly Asp Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala 
85 90 95 

He Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Asn Thr Ser Gly 
100 105 ~ HO 

Ala Ala Lys Ser His He Val Gly Lys Val Tyr Leu His Gly Asp Thr 
115 120 * 125 

Gly Lys Pro Leu Asp Leu He He Thr Phe Asn Glu Thr Ser Asp Glu 
130 135 140 

Ser Cys Thr Tyr Cys He Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin 
145 150 155 * 160 

Tyr Lys Asn Glu Thr Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr He 
165 170 175 



112 



WO 03/046124 



PCTYUS02/33645 



Ala Lys Glu 



<210> 22 
<211> 183 
<212> PRT 

<213> human adenovirus Ad 2 
<400> 22 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg lie His Ser 
15 10 15 

Asp Asn Asp Cys Lys Phe Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 
20 25 ^ 30 

Val Leu Ala Thr Val Ala Ala Leu Ala Val Ser Gly Asp Leu Ser Ser 
35 40 45 

Met Thr Gly Thr Val Ala Ser Val Ser He Phe Leu Arg Phe Asp Gin 
50 55 60 

Asn Gly Val Leu Met Glu Asn Ser Ser Leu Lys Lys His Tyr Trp Asn 
65 70 75 - 80 

Phe Arg Asn Gly Asn Ser Thr Asn Ala Asn Pro Tyr Thr Asn Ala Val 
85 90 95 

Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Thr Gin Ser Gin Thr 
100 105 * 110 

Ala Lys Asn Asn He Val Ser Gin Val Tyr Leu His Gly Asp Lys Thr 
115 120 125 

Lys Pro Met He Leu Thr He Thr Leu Asn Gly Thr Ser Glu Ser Thr 
130 135 140 

Glu Thr Ser Glu Val Ser Thr Tyr Ser Met Ser Phe Thr Trp Ser Trp 
145 150 155 160 

Glu Ser Gly Lys Tyr Thr Thr Glu Thr Phe Ala Thr Asn Ser Tyr Thr 
165 170 175 

Phe Ser Tyr He Ala Gin Glu 
180 

<210> 23 
<211> 182 
<212> PRT 

<213> human adenovirus Ad 5 
<400> 23 

Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala 
15 10 15 
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Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 
20 25 30 

lie Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro 
35 40 45 

He Ser Gly Thr Val Gin Ser Ala His Leu lie He Arg Phe Asp Glu 
50 55 60 

Asn Gly Val Leu He Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn 
65 ~ 70 75 80 

Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val 
85 90 95 

Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro' Lys Ser His Gly Lys Thr 
100 105 110 

Ala Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr 
115 120 125 

Lys Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly 
130 135 140 

Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser 
145 150 155 160 

Gly His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Glu 
165 170 175 



Ser Tyr lie Ala Gin Glu 
180 



<210> 24 

<211> 34264 

<212> DNA 

<213> simian adenovirus SV-1 

<220> 

<221> CDS 

<222> (12454) • . (13965) 

<223> L2 Pent on 

<220> 

<221> CDS 

<222> (16841) (19636) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (28059) . . (29150) 

<223> L5 Fiber #2 
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<220> 

<221> CDS 

<222> (29183) (30865) 
<223> L5 Fiber #1 

<400> 24 

tccttattct ggaaacgtgc caatatgata atgagcgggg aggagcgagg cggggccggg 60 

gtgacgtgcg gtgacgtggg gtgacgcggg gtggcgcgag ggcggggcgg gagtggggag 120 

gcgcttagtt tttacgtatg cggaaggagg ttttataccg gaagttgggt aatttgggcg 180 

tatacttgta agttttgtgt aatttggcgc gaaaaccggg taatgaggaa gttgaggtta 240 

atatgtactt tttatgactg ggcggaattt ctgctgatca gcagtgaact ttgggcgctg 300 

acggggaggt ttcgctacgt ggcagtacca cgagaaggct caaaggtccc atttattgta 360 

ctcctcagcg ttttcgctgg gtatttaaac gctgtcagat catcaagagg ccactcttga 420 

gtgccggcga gtagagtttt ctcctccgcg ctgccgcgat gaggctggtt cccgagatgt 480 

acggtgtttt ctgcagcgag acggcccgga actcagatga gctgcttaat acagatctgc 540 

tggatgttcc caactcgcct gtggcttcgc ctccgtcgct tcatgatctt ttcgatgtgg 600 

aagtggatcc accgcaagat cccaacgagg acgcggtaaa cagtatgttc cctgaatgtc 660 

tgtttgaggc ggctgaggag ggttctcaca gcagtgaaga gagcagacgg ggagaggaac 720 

tggacttgaa atgctacgag gaatgtctgc cttctagcga ttctgaaacg gaacagacag 780 

ggggagacgg ctgtgagtcg gcaatgaaaa atgaacttgt attagactgt ccagaacatc 840 

ctggtcatgg ctgccgtgcc tgtgcttttc atagaaatgc cagcggaaat cctgagactc 900 

tatgtgctct gtgttatctg cgccttacca gcgattttgt atacagtaag taaagtgttt 960 

tcattggcgt acggtagggg attcgttgaa gtgctttgtg acttattatg tgtcattatt 1020 

tctaggtgac gtgtccgacg tggaagggga aggagataga tcaggggctg ctaattctcc 1080 

ttgcactttg ggggctgtgg ttccagttgg catttttaaa ccgagtggtg gaggagaacg 1140 

agccggagga gaccgagaat ctgagagccg gcctggaccc tccagtggaa gactaggtgc 1200 

tgaggatgat cctgaagagg ggactagtgg gggtgctagg aaaaagcaaa aaactgagcc 1260 

tgaacctaga aactttttga atgagttgac tgtaagccta atgaatcggc agcgtcctga 1320 

gacggtgttt tggactgagt tggaggatga gttcaagaag ggggaattaa acctcttgta 1380 

caagtatggg tttgagcagt tgaaaactca ctggttggag ccgtgggagg atatggaaat 1440 

ggctctagac acctttgcta aagtggctct gcggccggat aaagtttaca ctattcgccg 1500 
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cactgttaat ataaaaaaga gtgtttatgt tatcggccat ggagctctgg tgcaggtgca 1560 

gaccccagac cgggtggctt tcaattgcgg catgcagagt ttgggccccg gggtgatagg 1620 

tttgaatgga gttacatttc aaaatgtcag gtttactggt gatgatttta atggctctgt 1680 

gtttgtgact agcacccagc taaccctcca cggtgtttac ttttttaact ttaacaatac 1740 

atgtgtggag tcatggggta gggtgtctct gaggggctgc agttttcatg gttgctggaa 1800 

ggcggtggtg ggaagaatta aaagtgtcat gtctgtgaag aaatgcatat ttgaacgctg 1860 

tgtgatagct ctagcagtag aggggtacgg acggatcagg aataacgccg catctgagaa 1920 

tggatgtttt cttttgctga aaggtacggc cagcgttaag cataatatga tttgcggcag 1980 

cggcctgtgc ccctcgcagc tcttaacttg cgcagatgga aactgtcaca ccttgcgcac. 2040 

cgtgcacata gtgtcccact cgcgccgcac ctggccaaca tttgagcaca atatgctcat 2100 

gcgttgcgcc gttcacctag gtgctagacg cggcgtgttt atgccttatc aatgtaactt 2160 

tagtcatact aagattttgc tggaaactga ttccttccct cgagtatgtt tcaatggggt 2220 

gtttgacatg tcaatggaac tttttaaagt gataagatat gatgaaacca agtctcgttg 2280 

tcgctcatgt gaatgcggag ctaatcattt gaggttgtat cctgtaaccc tgaacgttac 2340 

cgaggagctg aggacggacc accacatgct gtcttgcctg cgtaccgact atgaatccag 2400 

cgatgaggag tgaggtgagg ggcggagcca caaagggtat aaaggggcat gaggggtggg 2460 

cgcggtgttt caaaatgagc gggacgacgg acggcaatgc gtttgagggg ggagtgttca 2520 

gcccatatct gacatctcgt cttccttcct gggcaggagt tcgtcagaat gtagtgggct 2580 

ccaccgtgga cggacggccg gtcgcccctg caaattccgc caccctcacc tatgccaccg 2640 

tgggatcatc gttggacact gccgcggcag ctgccgcttc tgctgccgct tctactgctc 2700 

gcggcatggc ggctgatttt ggactatata accaactggc cactgcagct gtggcgtctc 2760 

ggtctctggt tcaagaagat gccctgaatg tgatcttgac tcgcctggag atcatgtcac 2820 

gtcgcctgga cgaactggct gcgcagatat cccaagctaa ccccgatacc gcttcagaat 2880 

cttaaaataa agacaaacaa atttgttgaa aagtaaaatg gctttatttg ttttttttgg 2940 

ctcggtaggc tcgggtccac ctgtctcggt cgttaaggac tttgtgtatg ttttccaaaa 3000 

cacggtacag atgggcttgg atgttcaagt acatgggcat gaggccatct ttggggtgga 3060 

gataggacca ctgaagagcg tcatgttccg gggtggtatt gtaaatcacc cagtcgtagc 3120 

agggtttttg agcgtggaac tggaatatgt ccttcaggag caggctaatg gccaagggta 3180 

gacccttagt gtaggtgttt acaaagcggt tgagctggga gggatgcatg cggggggaga 3240 
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tgatatgcat 


cttggcttgg 


attttgaggt 


tagctatgtt 


accacccagg 


tctctgcggg 


3300 


ggttcatgtt 


atgaaggacc 


accagcacgg 


tatagccagt 


gcatttgggg 


aacttgtcat 


3360 


gcagtttgga 


ggggaaggcg 


tggaagaatt 


tagatacccc 


cttgtgcccc 


cctaggtttt 


3420 


ccatgcactc 


atccataata 


atggcaatgg 


gacccctggc 


ggccgcttta 


gcaaacacgt 


3480 


tttgggggtt 


ggaaacatca 


tagttttgct 


ctagagtgag 


ctcatcatag 


gccatcttta 


3540 


caaagcgggg 


taggagggtg 


cccgactggg 


ggatgatagt 


tccatctggg 


cctggagcgt 


3600 


agttgccctc 


acagatctgc 


atctcccagg 


ccttaatttc 


cgaggggggg 


atcatgtcca 


3660 


cctggggggc 


gataaaaaac 


acggtttctg 


gcggggggtt 


aatgagctgg 


gtggaaagca 


3720 


agttacgcaa 


cagctgggat 


ttgccgcaac 


cggtgggacc 


gtagatgacc 


ccgatgacgg 


3780 


gttgcagctg 


gtagttcaga 


gaggaacagc 


tgccgtcggg 


gcgcaggagg 


ggagctacct 


3840 


cattcatcat 


gcttctgaca 


tgtttatttt 


cactcactaa 


gttttgcaag 


agcctctccc 


3900 


cacccaggga 


taagagttct 


tccaggctgt 


tgaagtgttt 


cagcggtttc 


aggccgtcgg 


3960 


ccatgggcat 


cttttcaagc 


gactgacgaa 


gcaagtacag 


tcggtcccag 


agctcggtga 


4020 


cgtgctctat 


ggaatctcga 


tccagcagac 


ttcttggttt 


cgggggttgg 


gccgactttc 


4080 


gctgtagggc 


accagccggt 


gggcgtccag 


ggccgcgagg 


gttctgtcct 


tccagggtct 


4140 


cagcgttcgg 


gtgagggtgg 


tctcggtgac 


ggtgaaggga 


tgagccccgg 


gctgggcgct 


4200 


tgcgagggtg 


cgcttcaggc 


tcatcctgct 


ggtgctgaag 


cgggcgtcgt 


ctccctgtga 


4260 


gtcggccaga 


tagcaacgaa 


gcatgaggtc 


gtagctgagg 


gactcggccg 


cgtgtccctt 


4320 


ggcgcgcagc 


tttcccttgg 


aaacgtgctg 


acatttggtg 


cagtgcagac 


acttgagggc 


4380 


gtagagtttt 


ggggccagga 


agaccgactc 


gggcgagtag 


gcgtcggctc 


cgcactgagc 


4440 


gcagacggtc 


tcgcactcca 


ccagccacgt 


gagctcgggt 


ttagcgggat 


caaaaaccaa 


4500 


gttgcctcca 


ttttttttga 


tgcgtttctt 


accttgcgtc 


tccatgagtc 


tgtgtcccgc 


4560 


ttccgtgaca 


aaaaggctgt 


cggtatcccc 


gtagaccgac 


ttgagggggc 


gatcttccaa 


4620 


aggtgttccg 


aggtcttccg 


cgtacaggaa 


ctgggaccac 


tccgagacaa 


aggctcgggt 


4680 


ccaggctaac 


acgaaggagg 


cgatctgcga 


ggggtatctg 


tcgttttcaa 


tgagggggtc 


4740 


caccttttcc 


agggtgtgca 


gacacaggtc 


gtcctcctcc 


gcgtccacga 


aggtgattgg 


4800 


cttgtaagtg 


taggtcacgt 


gacccgcacc 


cccccaaggg 


gtataaaagg 


gggcgtgccc 


4860 


actctccccg 


tcactttctt 


ccgcatcgct 


gtggaccaga 


gccagctgtt 


cgggtgagta 


4920 
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ggccctctca aaagccggca tgatttcggc gctcaagttg tcagtttcta caaacgaggt 4980 

ggatttgata ttcacgtgcc ccgcggcgat gcttttgatg gtggaggggt ccatctgatc 5040 

agaaaacacg atctttttat tgtcaagttt ggtggcgaaa gacccgtaga gggcgttgga 5100 

aagcaacttg gcgatggagc gcagggtctg atttttctcc cgatcggccc tctccttggc 5160 

ggcgatgttg agttgcacgt actcgcgggc cacgcaccgc cactcgggga acacggcggt 5220 

gcgctcgtcg ggcaggatgc gcacgcgcca gccgcggttg tgcagggtga tgaggtccac 5280 

gctggtggcc acctccccgc ggaggggctc gttggtccaa cacaatcgcc ccccttttct 5340 

ggagcagaac ggaggcaggg gatctagcaa gttggcgggc ggggggtcgg cgtcgatggt 5400 

aaatatgccg ggtagcagaa ttttattaaa ataatcgatt tcggtgtccg tgtcttgcaa 5460 

cgcgtcttcc cacttcttca ccgccagggc cctttcgtag ggattcaggg gcggtcccca 5520 

gggcatgggg tgggtcaggg ccgaggcgta catgccgcag atgtcgtaca cgtacagggg 5580 

ctccctcaac accccgatgt aagtggggta acagcgcccc ccgcggatgc tggctcgcac 5640 

gtagtcgtac atctcgtgag agggagccat gagcccgtct cccaagtggg tcttgtgggg 5700 

tttttcggcc cggtagagga tctgcctgaa gatggcgtgg gagttggaag agatagtggg 5760 

gcgttggaag acgttaaagt tggctccggg cagtcccacg gagtcttgga tgaactgggc 5820 

gtaggattcc cggagcttgt ccaccagggc tgcggttacc agcacgtcga gagcgcagta 5880 

gtccaacgtc tcgcggacca ggttgtaggc cgtctcttgt tttttctccc acagttcgcg 5940 
attgaggagg tattcctcgc ggtctttcca gtactcttcg gcgggaaatc ctttttcgtc 
cgctcggtaa gaacctaaca tgtaaaattc gttcacggct ttgtatggac aacagccttt 

ttctaccggc agggcgtacg cttgagcggc ctttctgaga gaggtgtggg tgagggcgaa 6120 

ggtgtcccgc accatcactt tcaggtactg atgtttgaag tccgtgtcgt cgcaggcgcc 6180 

ctgttcccac agcgtgaagt cggtgcgctt tttctgcctg ggattgggga gggcgaatgt 6240 

gacgtcgtta aagaggattt tcccggcgcg gggcatgaag ttgcgagaga tcctgaaggg 6300 

tccgggcacg tccgagcggt tgttgatgac ttgcgccgcc aggacgatct cgtcgaagcc 6360 

gttgatgttg tggcccacga tgtaaagttc gataaagcgc ggctgtccct tgagggccgg 6420 

cgcttttttc aactcctcgt aggtgagaca gtccggcgag gagagaccca gctccgcccg 6480 

ggcccagtcg gagagctgag ggttagccgc gaggaaagag ctccacaggt caagggctag 6540 

cagagtttgc aagcggtcgc ggaactcgcg aaactttttc cccacggcca ttttctccgg 6600 

cgtcaccacg tagaaagtgc aggggcggtc gttccagacg tcccatcgga gctctagggc 6660 



6000 
6060 
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cagctcgcag 


gcttgacgaa cgagggtctc ctcgcccgag acgtgcatga 


ccagcatgaa 


6720 


gggtaccaac 


tgtttcccga acgagcccat ccatgtgtag gtttctacgt 


cgtaggtgac 


6780 


aaagagccgc 


tgggtgcgcg cgtgggagcc gatcgggaag aagctgatct 


cctgccacca 


6840 


gttggaggaa 


tgggtgttga tgtggtgaaa gtagaagtcc cgccggcgca 


cagagcattc 


6900 


gtgctgatgt 


ttgtaaaagc gaccgcagta gtcgcagcgc tgcacgctct 


gtatctcctg 


6960 


aatgagatgc 


gcttttcgcc cgcgcaccag aaaccggagg gggaagttga 


gacgggggct 


7020 


tggtggggcg 


gcatcccctt cgccttggcg gtgggagtct gcgtctgcgc 


cctccttctc 


7080 


tgggtggacg 


acggtgggga cgacgacgcc ccgggtgccg caagtccaga 


tctccgccac 


7140 


ggaggggcgc 


aggcgttgca ggaggggacg cagctgcccg ctgtccaggg 


agtcgagggc 


7200 


ggccgcgctg 


aggtcggcgg gaagcgtttg caagttcact ttcagaagac 


cggtaagagc 


7260 


gtgagccagg 


tgcacatggt acttgatttc caggggggtg ttggaagagg 


cgtccacggc 


7320 


gtagaggagg 


ccgtgtccgc gcggggccac caccgtgccc cgaggaggtt 


ttatctcact 


7380 


cgtcgagggc 


gagcgccggg gggtagaggc ggctctgcgc cggggggcag 


cggaggcagt 


7440 


ggcacgtttt 


cgtgaggatt cggcagcggt tgatgacgag cccggagact 


gctggcgtgg 


7500 


gcgacgacgc 


ggcggttgag gtcctggatg tgccgtctct gcgtgaagac 


caccggcccc 


7560 


cgggtcctga 


acctgaaaga gagttccaca gaatcaatgt ctgcatcgtt 


aacggcggcc 


7620 


tgcctgagga 


tctcctgtac gtcgcccgag ttgtcttgat aggcgatctc 


ggccatgaac 


7680 


tgctccactt 


cttcctcgcg gaggtcgccg tggcccgctc gctccacggt 


ggcggccagg 


7740 


tcgttggaga 


tgcgacgcat gagttgagag aaggcgttga ggccgttctc 


gttccacacg 


7800 


cggctgtaca 


ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg 


ggccacgttg 


7860 


agttccacgt 


ggcgggcgaa gacggcgtag tttctgaggc gctggaagag 


gtagttgagc 


7920 


gtggtggcga 


tgtgctcgca gacgaagaag tacatgatcc agcgccgcag 


ggtcatctcg 


7980 


ttgatgtctc 


cgatggcttc gagacgctcc atggcctcgt agaagtcgac 


ggcgaagttg 


8040 


aaaaattggg 


agttgcgggc ggccaccgtg agttcttctt gcaggaggcg 


gatgagatcg 


8100 


gcgaccgtgt 


cgcgcacctc ctgctcgaaa gcgccccgag gcgcctctgc 


ttcttcctcc 


8160 


ggctcctcct 


cttccagggg cacgggttcc tccggcagct ctgcgacggg 


gacggggcgg 


8220 


cgacgtcgtc 


gtctgaccgg caggcggtcc acgaagcgct cgatcatttc 


gccgcgccgg 


8280 


cgacgcatgg 


tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc 


gaagacgccg 


8340 
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ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg cagggacacg 8400 

gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga tctgagaacg 8460 

tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc gcagtcgcaa 8520 

ggtaagctga ggacggtggg ccgctggggg gcgtccgcgg gcagttggga ggtgatgctg 8580 

ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag gaggaccacg 8640 

tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc ctcgctctga 8700 

cagcgacgca ggtctttgta gtagtcttgc atcagtctct ccaccggaac ctctgcttct 8760 

cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa cgctaggtcg 8820 

gccacgaccc tctcggccag cacggcctgt tggatctgcg tgagggtggt ctggaagtcg 8880 

tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca gttggccatg 8940 

acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt gaggcgcgag 9000 

taggcgcggg actcgaacac gtagtcgttg catgtgcgta ccagatactg gtagccaacc 9060 

aggaagtggg gaggcggttc tcggtacagg ggccagccga ctgtggcggg ggcgccgggg 9120 

gacaggtcgt ccagcatgag gcgatggtag tggtagatgt agcgggagag ccaggtgatg 9180 

ccggccgagg tggtcgcggc cctggtgaat tcgcggacgc ggttccagat gttgcgcagg 9240 

gggcgaaagc gctccatggt gggcacgctc tgccccgtga ggcgggcgca atcttgtacg 9300 

ctctagatgg aaaaaagaca gggcggtcat cgactccctt ccgtagctcg gggggtaaag 9360 

tcgcaagggt gcggcggcgg ggaaccccgg ttcgagaccg gccggatccg ccgctcccga 9420 

tgcgcctggc cccgcatcca cgacgtccgc gtcgagaccc agccgcgacg ctccgcccca 9480 

atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc ggcagatgcg 9540 

acctcagacg cccaccacca ccgccgcggc ggcagtaaac ctgagcggag gcggtgacag 9600 

ggaggaggag gagctggctt tagacctgga agagggagag gggctggccc ggctgggagc 9660 

gccgtcccca gagagacacc ctagggttca gctcgtgagg gacgccaggc aggcttttgt 9720 

gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga tgcgcgattg 9780 

caggtttcgg gcgggtagag agctgagggc gggcttcgat cgggagcggc tcctgagggc 9840 

ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc acgtctcggc 9900 

ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact tccaaaagag 9960 

ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg ggctgatgca 10020 

tctgtgggac ttcgtggagg cctacgtgca gaacccggcc agcaaacctc tgacggccca 10080 
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gctgttcctg 


atcgtgcagc acagccgcga 


caacgagacg 


ttccgcgacg 


ccatgttgaa 


10140 


catcgcggag 


cccgagggtc gctggctctt 


ggatctgatt 


aacatcctgc 


agagcatcgt 


10200 


ggtgcaggag 


aggggcctca gcttagcgga 


caaggtggcg 


gccattaact 


attcgatgca 


10260 


gagcctgggg 


aagttctacg ctcgcaagat 


ctacaagagc 


ccttacgtgc 


ccatagacaa 


10320 


ggaggtgaag 


atagacagct tttacatgcg 


catggcgctg 


aaggtgctga 


cgctgagcga 


10380 


cgacctcggc 


gtgtaccgta acgacaagat 


ccacaaggcg 


gtgagcgcca 


gccgccggcg 


10440 


ggagctgagc 


gacagggagc tgatgcacag 


cctgcagagg 


gcgctggcgg 


gcgccgggga 


10500 


cgaggagcgc 


gaggcttact tcgacatggg 


agccgatctg 


cagtggcgtc 


ccagcgcgcg 


10560 


cgccttggag 


gcggcgggct accccgacga 


ggaggatcgg 


gacgatttgg 


aggaggcagg 


10620 


cgagtacgag 


gacgaagcct gaccgggcag 


gtgttgtttt 


agatgcagcg 


gccggcggac 


10680 


ggggccaccg 


cggatcccgc acttttggca 


tccatgcaga 


gtcaaccttc 


gggcgtgacc 


10740 


gcctccgatg 


actgggcggc ggccatggac 


cgcattatgg 


cgctgactac 


ccgcaacccc 


10800 


gaggctttta 


gacagcaacc ccaggccaac 


cgtttttcgg 


ccatcttgga 


agcggtggtg 


10860 


ccctcccgca 


ccaaccccac acacgagaaa 


gtcctgacta 


tcgtgaacgc 


cctggtagac 


10920 


agcaaggcca 


tccgccgcga cgaggcgggc 


ttgatttaca 


acgctctgct 


ggaacgggtg 


10980 


gcgcgctaca 


acagcactaa cgttcagacc 


aatctggatc 


gcctcaccac 


cgacgtgaag 


11040 


gaggcgctgg 


ctcagaagga gcggtttctg 


agggacagca 


atctgggctc 


tctggtggca 


11100 


ctcaacgcct 


tcctgagcac gcagccggcc 


aacgtgcccc 


gcgggcagga 


ggactacgtg 


11160 


agcttcatca 


gcgctctgag gctgctggtg 


tccgaggtgc 


cccagagcga 


ggtgtatcag 


11220 


tctgggccgg 


attacttctt ccagacgtcc 


cgacagggct 


tgcaaacggt 


gaacctgact 


11280 


caggccttta 


aaaacttgca aggcatgtgg 


ggcgttaagg 


ccccggtggg 


cgatcgagcc 


11340 


accatctcca 


gtctgctgac ccccaacact 


cgcctgctgc 


tgctcttgat 


cgcgccgttc 


11400 


accaacagta 


gcactatcag ccgtgactcg 


tacctgggtc 


atctcatcac 


tttgtaccgc 


11460 


gaggccatcg 


gtcaggctca gatcgacgag 


cacacatatc 


aggagatcac 


taacgtgagc 


11520 


cgggccctgg 


gtcaggaaga taccggcagc 


ctggaagcca 


cgttgaactt 


tttgctaacc 


11580 


aaccggaggc 


aaaaaatacc ctcccagttt 


acgttaagcg 


ccgaggagga 


gaggattctg 


11640 


cgatacgtgc 


agcagtccgt gagtctgtac 


ttgatgcggg 


agggcgccac 


cgcttccacg 


11700 


gctttagaca 


fcgacggctcg gaacatggaa 


ccgtcctttt 


actccgccca 


ccggccgttc 


11760 
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attaaccgtc 


tgatggacta 


cttccatcgc gcggccgcca 


tgaacgggga 


gtacttcacc 


11820 


aatgccatcc 


tgaatccgca 


ttggatgccc ccgtccggct 


tctacaccgg 


cgagtttgac 


11880 


ctgcccgaag 


ccgacgacgg 


ctttctttgg gacgacgtgt 


ccgacagcat 


tttcacgccg 


11940 


ggcaatcgcc 


gattccagaa 


gaaggagggc ggagacgagc 


tccccctctc 


cagcgtggag 


12000 


gcggcctcta 


ggggagagag 


tccctttccc agtctgtctt 


ccgccagcag 


tggtcgggta 


12060 


acgcgcccgc 


ggttgccggg 


ggagagcgac tacctgaacg 


accccttgct 


gcggccggct 


12120 


aggaagaaaa 


atttccccaa 


caacggggtg gaaagcttgg 


tggataaaat 


gaatcgttgg 


12180 


aagacctacg 


cccaggagca 


gcgggagtgg gaggacagtc 


agccgcgacc 


gctggttccg 


12240 


ccgcactggc 


gtcgtcagag 


agaagacccg gacgactccg 


cagacgatag 


tagcgtgttg 


12300 


gacctgggag 


ggagcggagc 


caaccccttt gctcacttgc 


aacccaaggg 


gcgttccagt 


12360 


cgcctctact 


aataaaaaag 


acgcggaaac ttaccagagc 


catggccaca 


gcgtgtgtcc 


12420 


tttcttcctc 


tctttcttcc 


tcggcgcggc aga atg aga aga gcg gtg aga gtc 


12474 



Met Arg Arg Ala Val Arg Val 
1 5 

acg ccg gcg gcg tat gag ggt ccg ccc cct tct tac gaa age gtg atg 
Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met 
10 15 20 

gga tea gcg aac gtg ccg gec acg ctg gag gcg cct tac gtt cct ccc 
Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr Val Pro Pro 
25 30 35 

aga tac ctg gga cct acg gag ggc aga aac age ate cgt tac tec gag 
Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser lie Arg Tyr Ser Glu 
40 45 50 55 

ctg gca ccc ctg tac gat acc acc aag gtg tac ctg gtg gac aac aag 
Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val Asp Asn Lys 
60 65 70 

teg gcg gac ate gec tec ctg aat tat caa aac gat cac age aat ttt 
Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn Asp His Ser Asn Phe 
75 80 85 

ctg act acc gtg gtg cag aac aat gac ttc acc ccg acg gag gcg ggc 
Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr Glu Ala Gly 
90 95 100 

acg cag acc att aac ttt gac gag cgt tec cgc tgg ggc ggt cag ctg 
Thr Gin Thr He Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu 
105 110 115 

aaa acc ate ctg cac acc aac atg ccc aac ate aac gag ttc atg tec 
Lys Thr He Leu His Thr Asn Met Pro Asn He Asn Glu Phe Met Ser 
120 125 130 135 



12522 



12570 



12618 



12666 



12714 



12762 



12810 



12858 
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acc aac aag ttc agg gcc agg ctg atg gtt aaa aag get gaa aac cag 12906 
Thr Asn Lys Phe Arg Ala Arg Leu Met Val Lys Lys Ala Glu Asn Gin 
140 145 150 

cct ccc gag tac gaa tgg ttt gag ttc acc att ccc gag ggc aac tat 12954 
Pro Pro Glu Tyr Glu Trp Phe Glu Phe Thr lie Pro Glu Gly Asn Tyr 
155 160 165 

tec gag acc atg act ate gat ctg atg aac aat gcg ate gtg gac aat 13002 
Ser Glu Thr Met Thr He Asp Leu Met Asn Asn Ala He Val Asp Asn 
170 175 180 

tac ctg caa gtg ggg agg cag aac ggg gta ttg gaa age gat ate ggc 13050 
Tyr Leu Gin Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp He Gly 
185 190 195 

gta aaa ttt gat acc aga aac ttc cga ctg ggg tgg gat ccc gtg acc 13098 
Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr 
200 205 210 215 

aag ctg gtg atg cca ggc gtg tac acc aac gag get ttt cac ccc gac 13146 
Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp 
220 225 230 

ate gtg ctg ctg ccg ggg tgc ggt gtg gac ttc act cag age cgt ttg 13194 
He Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Gin Ser Arg Leu 
235 240 245 

agt aac ctg tta ggg ate aga aag cgc cgc ccc ttc caa gag ggc ttt 13242 
Ser Asn Leu Leu Gly lie Arg Lys Arg Arg Pro Phe Gin Glu Gly Phe 
250 255 260 

cag ate atg tat gag gac ctg gaa gga ggt aac att cca ggt ttg eta 13290 
Gin He Met Tyr Glu Asp Leu Glu Gly Gly Asn He Pro Gly Leu Leu 
265 270 275 

gac gtg ccg gcg tat gaa gag agt gtt aaa cag gcg gag gcg cag gga 13338 
Asp Val Pro Ala Tyr Glu Glu Ser Val Lys Gin Ala Glu Ala Gin Gly 
280 285 . 290 295 

cga gag att cga ggc gac acc ttt gcc acg gaa cct cac gaa ctg gta 13386 
Arg Glu He Arg Gly Asp Thr Phe Ala Thr Glu Pro His Glu Leu Val 
300 305 310 

ata aaa cct ctg gaa caa gac agt aaa aaa egg agt tac aac att ata 13434 
He Lys Pro Leu Glu Gin Asp Ser Lys Lys Arg Ser Tyr Asn He He 
315 320 325 

tec ggc act atg aat acc ttg tac egg age tgg ttt ctg get tac aac 13482 
Ser Gly Thr Met Asn Thr Leu Tyr Arg Ser Trp Phe Leu Ala Tyr Asn 
330 335 340 

tac ggg gat ccc gaa aag gga gtg aga tea tgg acc ata etc acc acc 13530 
Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr He Leu Thr Thr 
345 350 355 
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acg gac gtg acc tgc ggc teg cag caa gtg tac tgg tec ctg ccg gat 13578 
Thr Asp Val Thr Cys Gly Ser Gin Gin Val Tyr Trp Ser Leu Pro Asp 
360 ~ 365 ' 370 375 

atg atg caa gac ccg gtc acc ttc cgc ccc tec acc caa gtc age aac 13626 
Met Met Gin Asp Pro Val Thr Phe Arg Pro Ser Thr Gin Val Ser Asn 
380 385 390 

ttc ccg gtg gtg ggc acc gag ctg ctg ccc gtc cat gee aag age ttc 13674 
Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val His Ala Lys Ser Phe 
395 400 405 

tac aac gaa cag gee gtc tac teg caa etc att cgc cag tec acc gcg 13722 
Tyr Asn Glu Gin Ala Val Tyr Ser Gin Leu lie Arg Gin Ser Thr Ala 
410 415 420 

ctt acc cac gtg ttc aat cgc ttt ccc gag aac cag att ctg gtg cgc 13770 
Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin lie Leu Val Arg 
425 430 435 

cct ccc get cct acc att acc acc gtc agt gaa aac gtt ccc gee etc 13818 
Pro Pro Ala Pro Thr lie Thr Thr Val Ser Glu Asn Val Pro Ala Leu 
440 445 450 455 

aca gat cac gga acc ctg ccg ctg cgc age agt ate agt gga gtt cag 13866 
Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser lie Ser Gly Val Gin 
460 465 470 

cgc gtg acc ate acc gac gee aga cgt cga acc tgt ccc tac gtt tac 13914 
Arg Val Thr lie Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr 
475 480 485 

aaa get ctt ggc gta gtg get cct aaa gtg etc tct agt cgc acc ttc 13962 
Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu Ser Ser Arg Thr Phe 
490 495 500 



taa acatgtccat cctcatctct cccgataaca acaccggctg gggactgggc 


14015 


teeggcaaga 


tgtaeggegg 


agceaaaagg 


cgctccagtc 


agcacccagt 


tegagttegg 


14075 


ggccacttcc 


gtgctccctg 


gggagcttac 


aagegaggae 


tctcgggccg 


aacggcggta 


14135 


gacgataeca 


tagatgeegt 


gattgecgae 


gcccgccggt 


acaaecccgg 


accggtcgct 


14195 


agcgccgcct 


ccaccgtgga 


ttccgtgatc 


gacagcgtgg 


tagctggege 


tegggectat 


14255 


gctcgccgca 


agaggeggct 


geateggaga 


cgtcgcccca 


ccgccgccat 


gctggcagcc 


14315 


agggccgtgc 


tgaggeggge 


eeggagggta 


ggcagaaggg 


ctatgcgccg 


cgctgccgcc 


14375 


aacgccgccg 


cegggaggge 


ccgccgacag 


gctgcccgcc 


aggctgetge 


cgccatcgct 


14435 


ageatggeca 


gacccaggag 


agggaaegtg 


tactgggtgc 


gcgattctgt 


gaegggagtc 


14495 


egagtgeegg 


tgcgcagccg 


acctccccga 


agttagaaga 


tccaagctgc 


gaagacggcg 


14555 


gtactgagtc 


tccctgttgt 


tatcagccca 


aeatgagcaa 


gcgcaagttt 


aaagaagaae 


14615 
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tgctgcagac 


gctggtgcct 


gagatctatg 


gccctccgga 


cgtgaagcct 


gacattaagc 


14675 


cccgcgatat 


caagcgtgtt 


aaaaagcggg 


aaaagaaaga 


ggaactcgcg 


gtggtagacg 


14735 


atggcggagt 


ggaatttatt 


aggagtttcg 


ccccgcgacg 


cagggttcaa 


tggaaagggc 


14795 


ggcgggtaca 


acgcgttttg 


aggccgggca 


ccgcggtagt 


ttttaccccg 


ggagagcggt 


14855 


cggccgttag 


gggtttcaaa 


aggcagtacg 


acgaggtgta 


cggcgacgag 


gacatattgg 


14915 


aacaggcggc 


tcaacagatc 


ggagaatttg 


cctacggaaa 


gcgttcgcgt 


cgcgaagacc 


14975 


tggccatcgc 


tttagacagc 


ggcaacccca 


cgcccagcct 


caaacctgtg 


acgctgcagc 


15035 


aggtgctccc 


cgtgagcgcc 


agcacggaca 


gcaagagggg 


aataaaaaga 


gaaatggaag 


15095 


atctgcagcc 


caccatccag 


ctcatggtcc 


ctaaacggca 


gaggctggaa 


gaggtcctgg 


15155 


agaaaatgaa 


agtggaccca 


agcatagagc 


cggacgtcaa 


agtcaggccg 


atcaaagaag 


15215 


tggcccctgg 


tctcggggtg 


cagacggtgg 


atatccagat 


ccccgtcacg 


tcagcttcga 


15275 


ccgccgtgga 


agccatggaa 


acgcaaacgg 


aaacccctgc 


cgcgatcggt 


accagggaag 


15335 


tggcgttgca 


aaccgacccc 


tggtacgaat 


acgccgcccc 


tcggcgtcag 


aggcgacccg 


15395 


ctcgttacgg 


ccccgccaac 


gccatcatgc 


cagaatatgc 


gctgcatccg 


tctatcctgc 


15455 


ccacccccgg 


ctaccgggga 


gtgacgtatc 


gcccgtcagg 


aacccgccgc 


cgaacccgtc 


15515 


gccgccgccg 


ctcccgtcgt 


gctctggccc 


ccgtgtcggt 


gcgccgcgta 


acacgccggg 


15575 


gaaagacagt 


taccattccc 


aacccgcgct 


accaccctag 


catcctttaa 


tgactctgcc 


15635 


gttttgcaga 


tggctctgac 


ttgccgcgtg 


cgccttcccg 


ttccgcacta 


tcgaggaaga 


15695 


tctcgtcgta 


ggagaggcat 


ggcgggtagt 


ggtcgccggc 


gggctttgcg 


caggcgcatg 


15755 


aaaggcggaa 


ttttacccgc 


tctgataccc 


ataatcgccg 


ccgccatcgg 


tgccataccc 


15815 


ggcgtcgctt 


cagtggcctt 


gcaagcagct 


cgtaataaat 


aaacgaaggc 


ttttgcactt 


15875 


atgtcctggt 


cctgactatt 


ttatgcagaa 


agagcatgga 


agacatcaat 


tttacgtcgc 


15935 


tggctccgcg 


gcacggctcg 


cggccgctca 


tgggcacctg 


gaacgacatc 


ggcaccagtc 


15995 


agctcaacgg 


gggcgctttc 


aattggggga 


gcctttggag 


cggcattaaa 


aactttggct 


16055 


ccacgattaa 


atcctacggc 


agcaaagcct 


ggaacagtag 


tgctggtcag 


atgctccgag 


16115 


ataaactgaa 


ggacaccaac 


ttccaagaaa 


aagtggtcaa 


tggggtggtg 


accggcatcc 


16175 


acggcgcggt 


agatctcgcc 


aaccaagcgg 


tgcagaaaga 


gattgacagg 


cgtttggaaa 


16235 


gctcgcgggt 


gccgccgcag 


agaggggatg 


aggtggaggt 


cgaggaagta 


gaagtagagg 


16295 
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aaaagctgcc 


cccgctggag 


aaagttcccg 


gtgcgcctcc 


gagacegcag 


aagcgaccca 


16355 


ggccagaact 


agaagaaact 


ctggtgacgg 


agagcaagga 


gcctccctcg 


tacgagcaag 


16415 


ccttgaaaga 


gggcgcctct 


ccaccctacc 


caatgacaaa 


accgatcgcg 


cctatggctc 


16475 


ggccggtgta 


cgggaaggac 


tacaagcctg 


teaegctaga 


gctccccccg 


ccgccaccgc 


16535 


cgccccccac 


gcgcccgacc 


gttccccccc 


ccctgccggc 


tccgtcggcg 


ggacccgtgt 


16595 


ccgcacccgt 


cgccgtgcct 


ctgccagccg 


cccgcccagt 


ggccgtggcc 


actgecagaa 


16655 


accccagagg 


ccagagagga 


gccaactggc 


aaagcacget 


gaacagcatc 


gtgggcctgg 


16715 


gagtgaaaag 


cctgaaacgc 


cgccgttgct 


attattaaaa 


gtgtagctaa 


aaaatttccc 


16775 


gttgtatacg 


cctcctatgt 


taccgccaga 


gaegegtgae 


tgtcgccgcg 


agcgccgctt 


16835 


tcaag atg gcc acc cca 
Met Ala Thr Pro 


teg atg atg ccg cag tgg tct tac atg cac ate 
Ser Met Met Pro Gin Trp Ser Tyr Met His He 


16885 



505 



510 



515 



gcc ggg cag gac gcc teg gag tac ctg age ccc ggt etc gtg cag ttc 
Ala Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe 
520 525 530 



16933 



gcc cgc gcc acc gac acc tac ttc age ttg gga aac aag ttt aga aac 
Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn 
535 540 545 550 



16981 



ccc acc gtg gcc ccc acc cac gat gta acc acg gac cgc teg caa agg 
Pro Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg 
555 560 565 



17029 



ctg acc ctg cgt ttt gtg ccc gta gac egg gag gac acc gcg tac tct 
Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser 
570 575 580 



17077 



tac aaa gtg cgc tac acg ctg gcc gta ggg gac aac cga gtg ctg gac 
Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp 
585 " 590 595 



17125 



atg gcc age acc tac ttt gac ate egg gga gtg ctg gat cgc ggt ccc 
Met Ala Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro 
600 605 610 



17173 



agt ttt aag ccc tac teg ggt acc gcg tac aat tec ctg get ccc aag 
Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys 
615 620 625 630 



17221 



ggc get ccc aac cct gca gaa tgg acg aat tea gac age aaa gtt aaa 
Gly Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys 
635 * 640 " 645 



17269 



gtg agg gca cag gcg cct ttt gtt age teg tat ggt get aca gcg att 
Val Arg Ala Gin Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala He 
650 655 660 



17317 
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aca aaa gag ggt att cag gtg gga gta acc tta aca gac tec gga tea 17365 
Thr Lys Glu Gly lie Gin Val Gly Val Thr Leu Thr Asp Ser Gly Ser 
665 670 675 

aca cca cag tat gca gat aaa acg tat cag cct gag ccg caa att gga 17413 
Thr Pro Gin Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin lie Gly 
680 685 690 

gaa eta cag tgg aac age gat gtt gga acc gat gac aaa ata gca gga 17461 
Glu Leu Gin Trp Asn Ser Asp Val Gly Thr Asp Asp Lys lie Ala Gly 
695 700 . 705 710 

aga gtg eta aag aaa aca acg ccc atg ttc cct tgt tac ggc tea tat 17509 
Arg Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr 
715 720 " "* 725 

gec agg ccc act aat gaa aaa gga gga cag gca aca ccg tec get agt 17557 
Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Thr Pro Ser Ala Ser 
730 735 740 

caa gac gtg caa aat ccc gaa tta caa ttt ttt gec tct act aat gtc 17605 
Gin Asp Val Gin Asn Pro Glu Leu Gin Phe Phe Ala Ser Thr Asn Val 
745 750 755 

gee aat aca cca aaa gca gtt eta tat gcg gag gac gtg tea att gaa 17653 
Ala Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser He Glu 
760 765 770 

gcg cca gac act cac ttg gtg ttc aaa cca aca gtc act gaa ggc att 17701 
Ala Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly He 
775 780 • 785 790 

aca agt tea gag get eta ctg acc caa caa get get ccc aac cgt cca 17749 
Thr Ser . Ser Glu Ala Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro 
795 800 805 



aac tac ata gee ttt aga gat aat ttt att ggt etc atg tac tac aat 
Asn Tyr He Ala Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn 
810 815 ~ 820 



17797 



age aca ggt aac atg gga gta ctg gca ggc cag get tct cag eta aat 17845 
Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn 
825 830 835 

gca gtt gtt gac ctg caa gac aga aat act gag ctg tec tac caa etc 17893 
Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu 
840 845 850 

atg ttg gac gee etc gga gac cgc agt egg tac ttt tct atg tgg aac 17941 
Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn 
855 860 865 870 

caa get gtg gat agt tac gat cct gat gta aga ate ata gaa aac cat 17989 
Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His 
875 880 885 
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ggc gta gaa gat gaa ttg cct aat tat tgc ttt cct ttg gga ggc atg 
Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met 
890 895 900 



18037 



gca gta acc gac acc tac teg cct ata aag gtt aat gga gga ggc aat 
Ala Val Thr Asp Thr Tyr Ser Pro lie Lys Val Asn Gly Gly Gly Asn 
905 910 915 



18085 



gga tgg gaa gec aat aac ggc gtt ttc acc gaa aga gga gtg gaa ata 
Gly Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu He 
920 925 930 



18133 



ggt tea ggg aac atg ttt gec atg gag att aac ctg caa gec aac eta 
Gly Ser Gly Asn Met Phe Ala Met Glu He Asn Leu Gin Ala Asn Leu 
935 940 945 950 



18181 



tgg cgt age ttt ctg tac tec aat att ggg ctg tac ctg cca gac tct 
Trp Arg Ser Phe Leu Tyr Ser Asn He Gly Leu Tyr Leu Pro Asp Ser 
955 960 965 



18229 



etc aaa ate act cct gac aac ate aca etc cca gag aac aaa aac acc 
Leu Lys He Thr Pro Asp Asn He Thr Leu Pro Glu Asn Lys Asn Thr 
970 975 980 



18277 



tat cag tat atg aac ggt cgc gtg acg cca ccc ggg ctg gtt gac acc 
Tyr Gin Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr 
985 990 995 



18325 



tac gtt aac gtg ggc gcg cgc tgg tec ccc gat gtc atg gac agt 
Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser 
1000 1005 1010 



18370 



att aac cct ttt aat cac cac cgc aac gee gga etc cgc tac cgt 
He Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 
1015 1020 1025 



18415 



tec atg etc ctg gga aac gga cgc tac gtg ccc ttc cac ate cag 
Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin 
1030 1035 1040 



18460 



gtg ccc cag aaa ttc ttt gca att aaa aac ctg ctg ctg etc ccc 
Val Pro Gin Lys Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro 
1045 1050 1055 



18505 



ggt tec tac acc tac gag tgg aac ttc cgc aag gac gtg aac atg 
Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 
1060 1065 1070 



18550 



ate ttg cag age teg ctg ggc aat gac ctg cga gtg gac ggg gee 
He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala 
1075 1080 1085 



18595 



age ate cgc ttc gac age ate aac ctg tac gec aac 
Ser He Arg Phe Asp Ser He Asn Leu Tyr Ala Asn 
1090 1095 1100 



ttt ttc ccc 
Phe Phe Pro 



18640 
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atg gcc cac aac acg gcc tec acc ctg gaa gec atg ctg cgc aac 
Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn 
1105 1110 1115 



18685 



gac acc aac gac caa tct ttc aac gac tac ctg tgc gcg gcc aac 
Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn 
1120 1125 1130 



18730 



atg ctg tac ccc ate ccc gcc aac gcc acc age gtg ccc ate tec 
Met Leu Tyr Pro lie Pro Ala Asn Ala Thr Ser Val Pro lie Ser 
1135 1140 1145 



18775 



att ccc tct cgc. aac tgg .gca gcc ttc agg ggc tgg agt ttc acc 
He Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr 
1150 1155 1160 



18820 



cgc etc aaa acc aag gag acc ccc teg ctg ggc tec 
Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser 
1165 1170 1175 



ggg ttc gac 
Gly Phe Asp 



18865 



ccc tac ttc gtc tac tec ggc tec ate ccc tac ctg gac ggc acc 
Pro Tyr Phe Val Tyr Ser Gly Ser He Pro Tyr Leu Asp Gly Thr 
1180 1185 1190 



18910 



ttc tac etc aac cat act ttc aaa aag gtg tea ate atg ttc gac 
Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He Met Phe Asp 
1195 1200 1205 



18955 



tec tec gtc age tgg ccc ggc aac gac cgt ctg ctg acg ccc aac 
Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn 
1210 1215 1220 



19000 



gag ttc gaa ate aag cgt teg gtg gac ggt gaa ggg tac aac gtg 
Glu Phe Glu He Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val 
1225 ~ 1230 1235 



19045 



get cag age aac atg acc aag gac tgg ttc ctg att cag atg etc 
Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu He Gin Met Leu 
1240 1245 1250 



19090 



age cac tac aac ate ggc tac cag ggc ttc tac gtg ccc gaa aat 
Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn 
1255 1260 1265 



19135 



tac aag gac cgc atg tac tct ttc ttc aga aac ttc caa ccc atg 
Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met 
1270 1275 ^ 1280 



19180 



age cgc caa att gta gat tea acg get tac act aat tat cag gat 
Ser Arg Gin He Val Asp Ser Thr Ala Tyr Thr Asn Tyr Gin Asp 
1285 1290 1295 



19225 



gtg aaa ctg cca tac cag cat aac aac tea ggg ttc gtg ggc tac 
Val Lys Leu Pro Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr 
1300 1305 1310 



19270 
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atg gga ccc acc atg cga gag ggg cag gcc tac ccg gcc aac tat 
Met Gly Pro Thr Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr 
1315 1320 1325 

ccc tat ccc ctg att ggg gcc acc gcc gtg ccc age etc acg cag 
Pro Tyr Pro Leu lie Gly Ala Thr Ala Val Pro Ser Leu Thr Gin 
1330 1335 1340 

aaa aag ttc etc tgc gac egg gtg atg tgg agg ate ccc ttc tct 
Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg He Pro Phe Ser 
1345 1350 1355 

age aac ttc atg tct atg ggc tec etc acc gac ctg ggg cag aac 
Ser Asn Phe Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn 
1360 1365 1370 

atg ctg tac gcc aac tec get cac gcc ttg gat atg acc ttt gag 
Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu 
1375 1380 1385 

gtg gat ccc atg gat gag ccc acg ctt etc tat gtt ctg ttt gaa 
Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu 
1390 1395 1400 

gtc ttc gac gtg gtg cgc ate cac cag ccg cac cgc ggc gtc ate 
Val Phe Asp Val Val Arg He His Gin Pro His Arg Gly Val He 
1405 1410 1415 

gag gcc gtc tac ctg cgc aca cct ttc tct gcc ggt aac gcc acc 
Glu Ala Val Tyr Leu Arg. Thr Pro Phe Ser Ala Gly Asn Ala Thr 
1420 ~ 1425 1430 

acc taa agaagecgat gggctccagc gaacaggagc tgeaggecat tgttcgcgac 
Thr 



19315 



19360 



19405 



19450 



19495 



19540 



19585 



19630 



19686 



ctgggctgcg 


ggccctactt 


tttgggcacc 


ttcgacaagc gttttccegg 


cttcatgtcc 


19746 


ccccacaagc 


cggcctgtgc 


categttaac 


acggccggac gggagacegg 


gggggftccac 


19806 


tggctcgcct 


tcgcctggaa 


cccgcgtaac 


cgcacctgct acctgttcga 


cccttttggt 


19866 


ttctccgacg 


aaaggctgaa 


gcagatctac 


cagttcgagt acgaggggct 


cctcaagcgc 


19926 


agegctctgg 


cctccacgcc 


cgaccactgc 


gtcaccctgg aaaagtccac 


ccaaacggtc 


19986 


caggggcccc 


tctcggccgc 


ctgcgggctc 


ttctgttgca tgtttttgca 


cgccttcgtg 


20046 


cactggcctc 


acacccccat 


ggatcacaac 


cccaccatgg atctgctcac 


cggagtgccc 


20106 


aacagcatgc 


ttcacagccc 


ccaggtcgcc 


cccaccctgc gccgtaacca 


ggaacacctg 


20166 


tategcttte 


tggggaaaca 


ctctgcctat 


tttcgccgcc accggcagcg 


catcgaaegg 


20226 


gccacggcct 


tcgaaagcat 


gagecaaaga 


gtgtaatcaa taaaaaacat 


ttttatttga 


20286 


catgataege 


gcttctggcg 


ttttattaaa 


aatcgaaggg ttcgagggag 


gggtcctcgt 


20346 
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gcccgctggg 


gagggacacg 


ttgcgatact 


ggaaacgggc gctccaacga 


aactcgggga 


20406 


tcaccagccg 


cggcaggggc 


acgtcttcta 


ggttctgctt ccaaaactgc 


cgcaccagct 


20466 


gcagggctcc 


catgacgtcg 


ggcgccgata 


tcttgaagtc gcagttaggg 


ccggagctcc 


20526 


cgcggctgtt 


gcggaacacg 


gggttggcac 


actggaacac cagcacgccg 


gggttgtgga 


20586 


tactggccag 


ggccgtcggg 


tcggtcacct 


ccgacgcatc cagatcctcg 


gcgttgctca 


20646 


gggcaaacgg 


ggtcagcttg 


cacatctgcc 


gcccaatctg- gggtactagg 


tcgcgcttgt 


20706 


tgaggcagtc 


gcagcgcaga 


gggatcagga 


tgcgtcgctg cccgcgttgc 


atgatagggt 


20766 


aactcgccgc 


caggaactcc 


tccatttgac 


ggaaggccat ctgggctttg 


ccgccctcgg 


20826 


tgtagaatag 


cccgcaggac 


ttgctagaga 


atacgttatg accgcagttg 


acgtcctccg 


20886 


cgcagcagcg 


ggcgtcttcg 


ttcttcagct 


gaaccacgtt gcggccccaa 


cggttctgga 


20946 


ccaccttggc 


tctagtgggg 


tgctccttca 


gcgcccgctg tccgttctcg 


ctggttacat 


21006 


ccatttccaa 


cacgtgctcc 


ttgcagacca 


tctccactcc gtggaagcaa 


aacaggacgc 


21066 


cctcctgctg 


ggtactgcga 


tgctcccata 


cggcgcatcc ggtgggctcc 


cagctcttgt 


21126 


gttttacccc 


cgcgtaggct 


tccatgtaag 


ccataaggaa tctgcccatc 


agctcggtga 


21186 


aggtcttctg 


gttggtgaag 


gttagcggca 


ggccgcggtg ctcctcgttc 


aaccaagttt 


21246 


gacagatctt 


gcggtacacc 


gctccctggt 


cgggcagaaa cttaaaagcc 


gctctgctgt 


21306 


cgttgtctac 


gtggaacttc 


tccattaaca 


tcatcatggt ttccataccc 


ttctcccacg 


21366 


ctgtcaccag 


tggtttgctg 


tcggggttct 


tcaccaacac ggcggtagag 


gggccctcgc 


21426 


cggccccgac 


gtccttcatg 


gtcattcttt 


gaaactccac ggagccgtcc 


gcgcgacgta 


21486 


ctctgcgcac 


cggagggtag 


ctgaagccca 


cctccaccac ggtgccttcg 


ccctcgctgt 


21546 


cggagacaat 


ctccggggat 


ggcggcggcg' 


cgggtgtcgc cttgcgagcc 


ttcttcttgg 


21606 


gagggagctg 


aggcgcctcc 


tgctcgcgct 


cggggctcat ctcccgcaag 


tagggggtaa 


21666 


tggagctgcc 


tgcttggttc 


tgacggttgg 


ccattgtatc ctaggcagaa 


agacatggag 


21726 


cttatgcgcg 


aggaaacttt 


aaccgccccg 


tcccccgtca gcgacgaaga 


tgtcatcgtc 


21786 


gaacaggacc 


cgggctacgt 


tacgccgccc 


gaggatctgg aggggcctga 


ccggcgcgac 


21846 


gctagtgagc 


ggcaggaaaa 


tgagaaagag 


gaggcctgct acctcctgga 


aggcgacgtt 


21906 


ttgctaaagc 


atttcgccag 


gcagagcacc 


atagttaagg aggccttgca 


agaccgctcc 


21966 


gaggtgccct 


tggacgtcgc 


cgcgctctcc 


caggcctacg aggcgaacct 


tttctcgcct 


22026 



131 



WO 03/046124 



PCT/US02/33645 



cgagtgcctc 


cgaagagaca gcccaacggc 


acctgcgagc 


ccaacccgcg 


actcaacttc 


22086 


taccccgtgt 


tcgccgtacc aga'ggcgctg 


gccacctatc 


acattttttt 


caaaaaccaa 


22146 


cgcatccccc 


tatcgtgccg ggccaaccgc 


accgcggccg 


ataggaatct 


caggcttaaa 


22206 


aacggagcca 


acatacctga tatcacgtcg 


ctggaggaag 


tgcccaagat 


tttcgagggt 


22266 


ctgggtcgag 


atgagaagcg ggcggcgaac 


gctctgcaga 


aagaacagaa 


agagagtcag 


22326 


aacgtgctgg 


tggagctgga gggggacaac 


gcgcgtctgg 


ccgtcctcaa 


acgctgcata 


22386 


gaagtctccc 


acttcgccta ccccgccctc 


aacttgccac 


ccaaagttat 


gaaatcggtc 


22446 


atggatcagc 


tgctcatcaa gagagctgag 


cccctggatc 


ccgaccaccc 


cgaggcggaa 


22506 


aactcagagg 


acggaaagcc cgtcgtcagc 


gacgaggagc 


tcgagcggtg 


gctggaaacc 


22566 


agggaccccc 


aacagttgca agagaggcgc 


aagatgatga 


tggcggccgt 


gctggtcacc 


22626 


gtggagctgg 


aatgcctgca acggtttttc 


agcgacgtgg 


agacgctacg 


caaaatcggg 


22686 


gaatccctgc 


actacacctt ccgccagggc 


tacgtccgcc 


aggcctgcaa 


gatctccaac 


22746 


gtggagctca 


gcaacctggt ctcctacatg 


ggcatcctcc 


acgagaaccg 


gctggggcag 


22806 


agcgtgctgc 


actgcacctt gcaaggcgag 


gcgcggcggg 


actacgtgcg 


agactgcatc 


22866 


tacctcttcc 


tcaccctcac ctggcagacc 


gccatgggcg 


tctggcagca 


gtgcttggaa 


22926 


gagagaaacc 


tcaaagagct agacaaactc 


ctctgccgcc 


agcggcgcgc 


cctgtggtcc 


22986 


ggtttcagcg 


agcgcacggt cgccagcgct 


ctggcggaca 


tcatcttccc 


ggagcgcctg 


23046 


atgaaaacct 


tgcaaaacgg cctgccggat 


ttcatcagtc 


aaagcatttt 


gcaaaacttc 


23106 


cgctcttttg 


tcctggaacg ctccgggatc 


ttgcccgcca 


tgagctgcgc 


gctaccttct 


23166 


gactttgtcc 


ccctctccta ccgcgagtgc 


cctcccccac 


tgtggagcca 


ctgctacctc 


23226 


ttccaactgg 


ccaactttct ggcctaccac 


tccgacctca 


tggaagacgt 


aagcggagag 


23286 


ggtttactgg 


agtgccactg ccgctgcaac 


ctgtgcaccc 


cccacagatc 


gctggcctgc 


23346 


aacaccgagc 


tactcagcga aacccaggtc 


ataggtacct 


tcgagatcca 


ggggccccag 


23406 


cagcaagagg 


gtgcttccgg cttgaagctc 


actccggcgc 


tgtggacctc 


ggcttactta 


23466 


cgcaaatttg 


tagccgagga ctaccacgcc 


cacaaaattc 


agttttacga 


agaccaatct 


23526 


cgaccaccga 


aagcccccct cacggcctgc 


gtcatcaccc 


agagcaagat 


cctggcccaa 


23586 


ttgcaatcca 


ucaaccaagc gcgccgcgau 




aaaa^^ UCy 


rr n net fit rr t a n 
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23646 


ctggaccccc 


agaccggcga ggaactcaac 


ccgtccacac 


tctccgtcga 


agcagccccc 


23706 


ccgagacatg 


ccgcccaagg gaaccgccaa 


gcagctgatc 


gctcggcaga 


gagegaagaa 


23766 
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gcaagagctg 


ctccagcagc 


aggtggagga 


cgaggaagag 


atgtgggaca 


gccaggcaga 


23826 


ggaggtgtca 


gaggacgagg 


aggagatgga 


aagctgggac 


agcctagacg 


aggaggagga 


23886 


cgagctttca 


gaggaagagg 


cgaccgaaga 


aaaaccacct 


gcatccagcg 


cgccttctct 


23946 


gagccgacag 


ccgaagcccc 


ggcccccgac 


gcccccggcc 


ggctcactca 


aagccagccg 


24006 


taggtgggac 


gccaccgaat 


ctccagcggc 


agcggcaacg 


gcagcgggta 


aggccaaacg 


24066 


cgagcggcgg 


gggtattgct 


cctggcgggc 


ccacaaaagc 


agtattgtga 


actgcttgca 


24126 


acactgcggg 


ggaaacatct 


cctttgcccg 


acgctacctc 


ctcttccatc 


acggtgtggc 


24186 


cttccctcgc 


aacgttctct 


attattaccg 


tcatctctac 


agcccctacg 


aaacgctcgg 


24246 


agaaaaaagc 


taaggcctcc 


tccgccgcga 


ggaaaaactc 


cgccgccgct 


gccgccgcca 


24306 


aggatccacc 


ggccaccgaa 


gagctgagaa 


agcgcatctt 


tcccactctg 


tatgctatct 


24366 


ttcagcaaag 


ccgcgggcag 


caccctcagc 


gcgaactgaa 


aataaaaaac 


cgctccttcc 


24426 


gctcgctcac 


ccgcagctgt 


ctgtaccaca 


agagagaaga 


ccagctgcag 


cgcaccctgg 


24486 


acgacgccga 


agcactgttc 


agcaaatact 


gctcagcgtc 


tcttaaagac 


taaaagaccc 


24546 


gcgctttttc 


cccctcggcc 


gccaaaaccc 


acgtcatcgc 


cagcatgagc 


aaggagattc 


24606 


ccacccccta 


catgtggagc 


tatcagcccc 


agatgggcct 


ggccgcgggg 


gccgcccagg 


24666 


actactccag 


caagatgaac 


tggctcagcg 


ccggccccca 


catgatctca 


cgagttaacg 


24726 


gcatccgagc 


ccaccgaaac 


cagattctct 


tagaacaggc 


ggcaatcacc 


gccacacccc 


24786 


ggcgccaact 


caacccgcct 


agttggcccg 


ccgcccaggt 


gtatcaggaa 


aatccccgcc 


24846 


cgaccacagt 


cctcctgcca 


cgcgacgcgg 


aggccgaagt 


cctcatgact 


aactctgggg 


24906 


tacaattagc 


gggcgggtcc 


aggtacgcca 


ggtacagagg 


tcgggccgct 


ccttactctc 


24966 


ccgggagtat 


aaagagggtg 


atcattcgag 


gccgaggtat 


ccagctcaac 


gacgagacgg 


25026 


tgagctcctc 


aaccggtctc 


agacctgacg 


gagtcttcca 


gctcggagga 


gcgggccgct 


25086 


cttccttcac 


cactcgccag 


gcctacctga 


ccctgcagag 


ctcttcctcg 


cagccgcgct 


25146 


ccgggggaat 


cggcactctc 


cagttcgtgg 


aagagttcgt 


tccctccgtc 


tacttcaacc 


25206 


ccttctccgg 


ctcgcctgga 


cgctacccgg 


acgccttcat 


tcccaacttt 


gacgcagtga 


25266 


gtgaatccgt 


ggacggctac 


gactgatgac 


agatggtgcg 


gccgtgagag 


ctcggctgcg 


25326 


acatctgcat 


cactgccgtc 


agcctcgctg 


ctacgctcgg 


gaggcgatcg 


tcttcagcta 


25386 


ctttgagctg 


ccggacgagc 


accctcaggg 


tccggctcac 


gggttgaaac 


tcgagatcga 


25446 
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gaacgcgctc 


gagtctcgcc 


tcatcgacac 


cttcaccgcc 


cgacctctcc 


tggtagaaat 


25506 


ccaacggggg 


atcactacca 


tcaccctgtt 


ctgcatctgc 


cccacgcccg 


gattacatga 


25566 


agatctgtgt 


tgtcatcttt 


gcgctcagtt 


taataaaaac 


tgaacttttt 


gccgcacctt 


25626 


caacgccatc 


tgtgatttct 


acaacaaaaa 


gttcttctgg 


caaaggtaca 


caaactgtat 


25686 


tttattctaa 


ttctacctca 


tctatcgtgc 


tgaactgcgc 


ctgcactaac 


gaacttatcc 


25746 


agtggattgc 


aaacggtagt 


gtgtgcaagt 


acttttgggg 


gaacgatata 


gttagtagaa 


25806 


ataacagcct 


ttgcgagcac 


tgcaactcct 


ccacactaat 


cctttatccc 


ccatttgtta 


25866 


ctggatggta 


tatgtgcgtt 


ggctccggtt 


taaatcctag 


ttgctttcat 


aagtggtttc 


25926 


tacaaaaaga 


gacccttccc 


aacaattctg 


tttctttttt 


cgccctatcc 


tactgctgtt 


25986 


ctccctctgg 


ttactctttc 


aaacctctaa 


ttggtatttt 


agctttgata 


ctcataatct 


26046 


ttattaactt 


tataataatt 


aacaacttac 


agtaaacatg 


cttgttctac 


tgctcgccac 


26106 


atctttcgct 


ctctctcacg 


ccagaacaag 


tattgttggc 


gcaggttaca 


atgcaactct 


26166 


tcaatctgct 


tacatgccag 


attccgacca 


gataccccat 


attacgtggt 


acttacaaac 


26226 


ctccaaacct 


aattcttcat 


tttatgaagg 


aaacaaactc 


tgcgatgact 


ccgacaacag 


26286 


aacgcacaca 


tttccccacc 


cttcactaca 


attcgaatgc 


gtaaacaaaa 


gcttgaagct 


26346 


ttacaactta 


aagccttcag 


attctggctt 


gtaccatgct 


gtagttgaaa 


aaagtaattt 


26406 


agaagtccac 


agtgattaca 


ttgaattgac 


ggttgtggac 


ctgccacctc 


caaaatgtga 


26466 


ggtttcctcc 


tcttaccttg 


aagttcaagg 


cgtggatgcc 


tactgcctca 


tacacattaa 


26526 


ctgcagcaac 


tctaaatatc 


cagctagaat 


ttactataat 


ggacaggaaa 


gtaatctttt 


26586 


ttattattta 


acaacaagcg 


ctggtaacgg 


taaacagtta 


cctgactatt 


ttactgctgt 


26646 


tgttgaattt 


tccacctaca 


gagaaacgta 


tgccaagcgg 


ccttacaatt 


tctcataccc 


26706 


gtttaacgac 


ctttgcaatg 


aaatacaagc 


gctcgaaact 


ggaactgatt 


ttactccaat 


26766 


tttcattgct 


gccattgttg 


taagcttaat 


taccattatt 


gtcagcctag 


cattttactg 


26826 


cttttacaag 


cccaaaaacc 


ctaagtttga 


aaaacttaaa 


ctaaaacctg 


tcattcaaca 


26886 


agtgtgattt 


tgttttccag 


catggtagct 


gcatttctac 


ttctcctctg 


tctacccatc 


26946 


attttcgtct 


cttcaacttt 


cgccgcagtt 


tcccacctgg 


aaccagagtg 


cctaccgcct 


27006 
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tataocctfic 


27066 


ttttttataa 


caatctttca 


agccgccgac 


tatttttacg 


tgcgaattgc 


ttactttaga 


27126 


caccatcctg 


aatacagaaa 


tcaaaacgtt 


gcctccttac 


tttgtttggc 


atgattaagt 


27186 
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tattgctgat 


acttaattat 


ttacccctaa tcaactgtaa 


ttgtccattc 


accaaaccct 


27246 


ggtcattcta 


cacctgttat 


gataaaatcc ccgacactcc 


tgttgcttgg 


ctttacgcag 


27306 


ccaccgccgc 


tttggtattt 


atatctactt gccttggagt 


aaaattgtat 


tttattttac 


27366 


acactgggtg 


gctacatccc 


agagaagatt tacctagata 


tcctcttgta 


aacgcttttc 


27426 


aattacagcc 


tctgcctcct 


cctgatcttc ttcctcgagc 


tccctctatt 


gtgagctact 


27486 


ttcaactcac 


cggtggagat 


gactgactct caggacatta 


atattagtgt 


ggaaagaata 


27546 


gctgctcagc 


gtcagcgaga 


aacgcgagtg ttggaatacc 


tggaactaca 


gcaacttaaa 


27606 


gagtcccact 


ggtgtgagaa 


aggagtgctg tgccatgtta 


agcaggcagc 


cctttcctac 


27666 


gatgtcagcg 


ttcagggaca 


tgaactgtct tacactttgc 


ctttgcagaa 


acaaaccttc 


27726 


tgcaccatga 


tgggctctac 


ctccatcaca atcacccaac 


aagccgggcc 


tgtagagggg 


27786 


gctatcctct 


gtcactgtca 


cgcacctgat tgcatgtcca 


aactaatcaa 


aactctctgt 


27846 


gctttaggtg 


atatttttaa 


ggtgtaaatc aataataaac 


ttaccttaaa 


tttgacaaca 


27906 


aatttctggt 


gacatcattc 


agcagcacca ctttaccctc 


ttcccagctc 


tcgtatggga 


27966 


tgcgatagtg 


ggtggcaaac 


ttcctccaaa ccctaaaaga 


aatattggta 


tccacttcct 


28026 


tgtcctcacc 


cacaattttc 


atcttttcat ag atg aaa aga acc aga gtt gat 
Met Lys Arg Thr Arg Val Asp 


28079 



1435 



1440 



gaa gac ttc aac ccc gtc tac ccc 
Glu Asp Phe Asn Pro Val Tyr Pro 
1445 

gca gtt ccc ttt ata tea ccc ccc 
Ala Val Pro Phe He Ser Pro Pro 
1460 

cag gaa aac ccc cca ggt gtt tta 
Gin Glu Asn Pro Pro Gly Val Leu 
1475 

eta tat ttc gac atg gag aga aaa 
Leu Tyr Phe Asp Met Glu Arg Lys 
1490 



tat gac acc aca acc act cct 28124 
Tyr Asp Thr Thr Thr Thr Pro 
1450 1455 

ttt gta aac age gat ggt ctt 28169 
Phe Val Asn Ser Asp Gly Leu 
1465 1470 

agt ctg cga ata get aaa ccc 28214 
Ser Leu Arg He Ala Lys Pro 
1480 1485 

eta gee ctt tea ctt gga aga 28259 
Leu Ala Leu Ser Leu Gly Arg 
1495 1500 



ggg ttg aca att acc gec gec gga 
Gly Leu Thr He Thr Ala Ala Gly 
1505 



caa tta gaa agt acg cag age 
Gin Leu Glu Ser Thr Gin Ser 
1510 1515 



28304 



gta caa acc aac cca ccg ttg ata 
Val Gin Thr Asn Pro Pro Leu lie 
1520 



att acc aac aac aac aca ctg 
He Thr Asn Asn Asn Thr Leu 
1525 1530 



28349 
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acc eta cgt cat tct ccc ccc tta aac eta act gac aat age tta 28394 

Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu 
1535 1540 1545 

gtg eta ggc tac teg agt cct etc cgc gtc aca gac aac aaa ctt 28439 

Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu 
1550 1555 1560 

aca ttt aac ttc aca tea cca etc cgt tat gaa aat gaa aac ctt 28484 

Thr Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu 
1565 1570 1575 

act ttt aac tat aca gag cct ctt aaa ctt ata aat aac age ctt 28529 

Thr Phe Asn Tyr Thr Glu Pro Leu Lys Leu lie Asn Asn Ser Leu 
1580 1585 1590 

gec att gac ate aat tec tea aaa ggc ctt agt age gtc gga ggc 28574 

Ala lie Asp lie Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly 
1595 1600 1605 

tea eta get gta aac ctg agt tea gac tta aag ttt gac age aac 28619 

Ser Leu Ala Val Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn 
1610 1615 1620 



gga tec ata get ttt ggc ata caa acc ctg tgg acc get ccg acc 
Gly Ser lie Ala Phe Gly lie Gin Thr Leu Trp Thr Ala Pro Thr 
1625 1630 1635 



28664 



teg act ggc aac tgc acc gtc tac age 
Ser Thr Gly Asn Cys Thr Val Tyr Ser 
1640 1645 



gag ggc gat tec eta ctt 
Glu Gly Asp Ser Leu Leu 
1650 



28709 



agt etc tgt tta acc aaa tgc gga get cac gtc tta gga agt gta 
Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly Ser Val 
1655 1660 1665 



28754 



agt tta acc ggt tta aca gga acc ata acc caa atg act gat att 
Ser Leu Thr Gly Leu Thr Gly Thr lie Thr Gin Met Thr Asp lie 
1670 1675 1680 

tct gtc acc att caa ttt. aca ttt gac aac aat ggt aag eta eta 
Ser Val Thr He Gin Phe Thr Phe Asp Asn Asn Gly Lys Leu Leu 
1685 1690 1695 

age tct cca ctt ata aac aac gec ttt agt att cga cag aat gac 
Ser Ser Pro Leu He Asn Asn Ala Phe Ser He Arg Gin Asn Asp 
1700 1705 1710 

agt acg gec tea aac cct acc tac aac gec ctg gcg ttt atg cct 
Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu Ala Phe Met Pro 
1715 1720 1725 



28799 



28844 



28889 



28934 



aac agt acc ata tat gca aga ggg gga 
Asn Ser Thr He Tyr Ala Arg Gly Gly 
1730 1735 



ggt ggt gaa cca cga aac 
Gly Gly Glu Pro Arg Asn 
1740 



28979 
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aac tac tac gtc 
Asn Tyr Tyr Val 
1745 



caa acg tat ctt agg gga aat gtt caa aaa cca 
Gin Thr Tyr Leu Arg Gly Asn Val Gin Lys Pro 
1750 1755 



29024 



ate att ctt act gta acc tac aac tea gtc gec aca gga tat tec 
lie lie Leu Thr Val thr Tyr Asn Ser Val Ala Thr Gly Tyr Ser 
1760 1765 1770 



29069 



tta tct ttt aag 
Leu Ser Phe Lys 
1775 



tgg act get ctt gca cgt gaa aag ttt gca acc 
Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr 
1780 1785 



29114 



cca aca acc teg 
Pro Thr Thr Ser 
1790 



ttt tgc tac att aca gaa caa taa aaccgtgtac 
Phe Cys Tyr lie Thr Glu Gin 
1795 



29160 



cccaccgttt cgtttttttc ag atg aaa egg gcg aga gtt gat gaa gac 

Met Lys Arg Ala Arg Val Asp Glu Asp 
1800 1805 



29209 



ttc aac cca gtg tac cct tat gac ccc cca cat get cct gtt atg 
Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro His Ala Pro Val Met 
1810 1815 1820 



29254 



ccc ttc att act 
Pro Phe He Thr 
1825 



cca cct ttt acc tec teg gat ggg ttg cag gaa 
Pro Pro Phe Thr Ser Ser Asp Gly Leu Gin Glu 
1830 ^ 1835 



29299 



aaa cca ctt gga gtg tta agt tta aac tac aga gat ccc att act 
Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp Pro He Thr 
1840 1845 1850 



29344 



acg caa aat gag tct ctt aca att aaa eta gga aac ggc etc act 
Thr Gin Asn Glu Ser Leu Thr He Lys Leu Gly Asn Gly Leu Thr 
1855 1860 " 1865 



29389 



eta gac aac cag gga caa eta aca tea acc get ggc gaa gta gaa 
Leu Asp Asn Gin Gly Gin Leu Thr Ser Thr Ala Gly Glu Val Glu 
1870 1875 1880 



29434 



cct cca etc act aac get aac aac aaa ctt gca ctg gtc tat age 
Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu Val Tyr Ser 
1885 1890 1895 



29479 



gat cct tta gca gta aag cgc aac age eta acc tta teg cac acc 
Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser His Thr 
1900 1905 1910 



29524 



get ccc ctt gtt att get gat aac tct tta gca ttg caa gtt tea 
Ala Pro Leu Val He Ala Asp Asn Ser Leu Ala Leu Gin Val Ser 
1915 1920 1925 



29569 



gag cct att ttt ata aat gac aag gac aaa eta gee ctg caa aca 
Glu Pro He Phe lie Asn Asp Lys Asp Lys Leu Ala Leu Gin Thr 
1930 1935 1940 



29614 
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gcc gcg ccc ctt gta act aac get ggc acc ctt cgc tta caa age 29659 
Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser 
1945 1950 1955 

gcc gcc cct tta ggc att gca gac caa acc eta aaa etc ctg ttt 29704 
Ala Ala Pro Leu Gly He Ala Asp Gin Thr Leu Lys Leu Leu Phe 
I960 1965 1970 

acc aac cct ttg tac ttg cag aat aac ttt etc acg tta gcc att 29749 
Thr Asn Pro Leu Tyr Leu Gin Asn Asn Phe Leu Thr Leu Ala He 
1975 1980 1985 

gaa cga ccc ctt gcc att acc aat act gga aag ctg get eta cag 29794 
Glu Arg Pro Leu Ala He Thr Asn Thr Gly Lys Leu Ala Leu Gin 
1990 1995 2000 

etc tec cca ccg eta caa aca gca gac aca ggc ttg act ttg caa 29839 
Leu Ser Pro Pro Leu Gin Thr Ala Asp Thr Gly Leu Thr Leu Gin 
2005 2010 2015 

acc aac gtg cca tta act gta age aac ggg acc eta ggc tta gcc 29884 
Thr Asn Val Pro Leu Thr Val Ser Asn Gly Thr Leu Gly Leu Ala 
2020 2025 2030 

ata aag cgc cca ctt att att cag gac aac aac ttg ttt ttg gac 29929 
He Lys Arg Pro Leu He He Gin Asp Asn Asn Leu Phe Leu Asp 
2035 2040 2045 

ttc aga get ccc ctg cgt ctt ttc aac age gac cca gta eta ggg 29974 
Phe Arg Ala Pro Leu Arg Leu Phe Asn Ser Asp Pro Val Leu Gly 
2050 2055 2060 

ctt aac ttt tac acc cct ctt gcg gta cgc gat gag gcg etc act 30019 
Leu Asn Phe Tyr Thr Pro Leu Ala Val Arg Asp Glu Ala Leu Thr 
2065 2070 2075 

gtt aac aca ggc cgc ggc etc aca gtg agt tac gat ggt tta att 30064 
Val Asn Thr Gly Arg Gly Leu Thr Val Ser Tyr Asp Gly Leu He 
2080 2085 2090 

tta aat ctt ggt aag gat ctt cgc ttt gac aac aac acc gtt tct 30109 
Leu Asn Leu Gly Lys Asp Leu Arg Phe Asp Asn Asn Thr Val Ser 
2095 2100 2105 

gtc get ctt agt get get ttg cct tta caa tac act gat cag ctt 30154 
Val Ala Leu Ser Ala Ala Leu Pro Leu Gin Tyr Thr Asp Gin Leu 
2110 2115 2120 

cgc ctt aac gtg ggc get ggg ctg cgt tac aat cca gtg agt aag 30199 
Arg Leu Asn Val Gly Ala Gly Leu Arg Tyr Asn Pro Val Ser Lys 
2125 2130 2135 

aaa ttg gac gtg aac ccc aat caa aac aag ggt tta acc tgg gaa 30244 
Lys Leu Asp Val Asn Pro Asn Gin Asn Lys Gly Leu Thr Trp Glu 
2140 2145 2150 
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aat gac tac etc att gta aag eta gga aat gga tta ggt ttt gat 30289 

Asn Asp Tyr Leu lie Val Lys Leu Gly Asn Gly Leu Gly Phe Asp 
2155 2160 2165 

ggc gat gga aac ata get gtt tct cct caa gtt aca teg cct gac 30334 

Gly Asp Gly Asn lie Ala Val Ser Pro Gin Val Thr Ser Pro Asp 
2170 2175 2180 



acc tta tgg acc act gee gac cca tec ccc aat tgt tec ate tac 
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser He Tyr 
2185 2190 2195 

act gat tta gat gec aaa atg tgg etc teg ttg gta aaa caa ggg 
Thr Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gin Gly 
2200 2205 2210 

ggt gtg gtt cac ggt tct gtt get tta aaa gca ttg aaa gga acc 
Gly Val Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr 
2215 2220 2225 

eta ttg agt cct acg gaa age gec att gtt att ata eta cat ttt 
Leu Leu Ser Pro Thr Glu Ser Ala He Val He He Leu His Phe 
2230 2235 2240 

gac aat tat gga gtg cga att etc aat tat ccc act ttg ggc act 
Asp Asn Tyr Gly Val Arg He Leu Asn Tyr Pro Thr Leu Gly Thr 
2245 2250 2255 

caa ggc acg ttg gga aat aat gca act tgg ggt tat agg cag gga 
Gin Gly Thr Leu Gly Asn Asn Ala Thr Trp Gly Tyr Arg Gin Gly 
2260 2265 2270 

gaa tct gca gac act aat gta etc aat gca eta gca ttt atg ccc 
Glu Ser Ala Asp Thr Asn Val Leu Asn Ala Leu Ala Phe Met Pro 
2275 2280 2285 

agt tea aaa agg tac cca aga ggg cgt gga age gaa gtt cag aat 
Ser Ser Lys Arg Tyr Pro Arg Gly Arg Gly Ser Glu Val Gin Asn 
2290 ^ 2295 2300 

caa act gtg ggc tac act tgt ata cag ggt gac ttt tct atg ccc 
Gin Thr Val Gly Tyr Thr Cys He Gin Gly Asp Phe Ser Met Pro 
2305 2310 2315 

gta ccg tac caa ata cag tac aac tat gga cca act ggc tac tec 
Val Pro Tyr Gin He Gin Tyr Asn Tyr Gly Pro Thr Gly Tyr Ser 
2320 2325 2330 

ttt aaa ttt att tgg aga act gtt tea aga caa cca ttt gac ate 
Phe Lys Phe He Trp Arg Thr Val Ser Arg Gin Pro Phe Asp He 
2335 2340 2345 

cca tgc tgt ttt ttc tct tac att acg gaa gaa taa aacaactttt 
Pro Cys Cys Phe Phe Ser Tyr He Thr Glu Glu 
2350 2355 



30379 



30424 



30469 



30514 



30559 



30604 



30649 



30694 



30739 



30784 



30829 



30875 



tctttttatt ttctttttat tttacacgea cagtaaggct tcctccaccc ttccatctca 30935 
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cagcatacac 


cagcctctcc 


cccttcatgg 


cagtaaactg 


ttgtgagtca 


gtccggtatt 


30995 


tgggagttaa 


gatccaaaca 


gtctctttgg 


tgatgaaaca 


tggatccgtg 


atggacacaa 


31055 


atccctggga 


caggttctcc 


aacgtttcgg 


taaaaaactg 


catgccgccc 


tacaaaacaa 


31115 


acaggttcag 


gctctccacg 


ggttatctcc 


ccgatcaaac 


tcagacagag 


taaaggtgcg 


31175 


atgatgttcc 


actaaaccac 


gcaggtggcg 


ctgtctgaac 


ctctcggtgc 


gactcctgtg 


31235 


aggctggtaa 


gaagttagat 


tgtccagcag 


cctcacagca 


tggatcatca 


gtctacgagt 


31295 


gcgtctggcg 


cagcagcgca 


tctgaatctc 


actgagattc 


cggcaagaat 


cgcacaccat 


31355 


cacaatcagg 


ttgttcatga 


tcccatagct 


gaacacgctc 


cagccaaagc 


tcattcgctc 


31415 


caacagcgcc 


accgcgtgtc 


cgtccaacct 


tactttaaca 


taaatcaggt 


gtctgccgcg 


31475 


tacaaacatg 


ctacccgcat 


acagaacctc 


ccggggcaaa 


cccctgttca 


ccacctgcct 


31535 


gtaccaggga 


aacctcacat 


ttatcaggga 


gccatagata 


gccattttaa 


accaattagc 


31595 


taacaccgcc 


ccaccagctc 


tacactgaag 


agaaccggga 


gagttacaat 


gacagtgaat 


31655 


aatccatctc 


tcataacccc 


taatggtctg 


atggaaatcc 


agatctaacg 


tggcacagca 


31715 


gatacacact 


ttcatataca 


ttttcatcac 


atgtttttcc 


caggccgtta 


aaatacaatc 


31775 


ccaatacacg 


ggccactcct 


gcagtacaat 


aaagctaata 


caagatggta 


tactcctcac 


31835 


ctcactaaca 


ttgtgcatgt 


tcatattttc 


acattctaag 


taccgagagt 


tctcctctac 


31895 


aacagcactg 


ccgcggtcct 


cacaaggtgg 


tagctggtga 


cgattgtaag 


gagccagtct 


31955 


gcagcgatac 


cgtctgtcgc 


gttgcatcgt 


agaccaggga 


ccgacgcact 


tcctcgtact 


32015 


tgtagtagca 


gaaccacgtc 


cgctgccagc 


acgtctccaa 


gtaacgccgg 


tccctgcgtc 


32075 


gctcacgctc 


cctcctcaac 


gcaaagtgca 


accactcttg 


taatccacac 


agatccctct 


32135 


cggcctccgg 


ggcgatgcac 


acctcaaacc 


tacagatgtc 


tcggtacagt 


tccaaacacg 


32195 


tagtgagggc 


gagttccaac 


caagacagac 


agcctgatct 


atcccgacac 


actggaggtg 


32255 


gaggaagaca 


cggaagaggc 


atgttattcc 


aagcgattca 


ccaacgggtc 


gaaatgaaga 


32315 


tcccgaagat 


gacaacggtc 


gcctccggag 


ccctgatgga 


atttaacagc 


cagatcaaac 


32375 


attatgcgat 


tttccaggct 


atcaatcgcg 


gcctccaaaa 


gagcctggac 


ccgcacttcc 


32435 


acaaacacca 


gcaaagcaaa 


agcgttatta 


tcaaactctt 


cgatcatcaa 


gctgcaggac 


32495 


tgtacaatgc 


ccaagtaatt 


ttCatttctc 


cactcgcgaa 


tgatgtcgcg 


gcaaatagtc 


32555 


tgaaggttca 


tgccgtgcat 


attaaaaagc 


tccgaaaggg 


cgccctctat 


agccatgcgt 


32615 



140 



WO 03/046124 PCT/US02/33645 

t 



agacacacca 


tcatgactgc 


aagatatcgg 


gctcctgaga cacctgcagc agatttaaca 


32675 


g^cccaggtc 


aggttgctct 


ccgcgatcgc 


gaatctccat ccgcaaagtc atttgcaaat 


32735 


aattaaatag 


atctgcgccg 


actaaatctg 


ttaactccgc gctaggaact aaatcaggtg 


32795 


tggctacgca 


gcacaaaagt 


tccagggatg 


gcgccaaact cactagaacc gctcccgagt 


32855 


agcaaaactg 


atgaatggga 


gtaacacagt 


gtaaaatgtt cagccaaaaa tcactaagct 


32915 


gctcctttaa 


aaagtccagt 


acttctatat 


tcagttcgtg caagtactga agcaactgtg 


32975 


cgggaatatg 


cacagcaaaa 


aaaatagggc 


ggctcagata catgttgacc taaaataaaa 


33035 


agaatcatta 


aactaaagaa 


gcctggcgaa 


cggtgggata tatgacacgc tccagcagca 


33095 


ggcaagcaac 


cggctgtccc 


cgggaaccgc 


ggtaaaattc atccgaatga ttaaaaagaa 


33155 


caacagagac 


ttcccaccat 


gtactcggtt 


ggatctcctg agcacagagc aatacccccc 


33215 


tcacattcat 


atccgctaca 


gaaaaaaaac 


gtcccagata cccagcggga atatccaacg 


33275 


acagctgcaa 


agacagcaaa 


acaatccctc 


tgggagcaat cacaaaatcc tccggtgaaa 


33335 


aaagcacata 


catattagaa 


taaccctgtt 


gctggggcaa aaaggcccgt cgtcccagca 


33395 


aatgcacata 


aatatgttca 


tcagccattg 


ccccgtctta ccgcgtaaac agccacgaaa 


33455 


aaatcgagct 


aaaatccacc 


caacagccta 


tagctatata tacactccac ccaatgacgc 


33515 


taataccgca 


ccacccacga 


ccaaagttca 


cccacaccca caaaacccgc gaaaatccag 


33575 


cgccgtcagc 


acttccgcaa 


tttcagtctc 


acaacgtcac ttccgcgcgc cttttcactt 


33635 


tcccacacac 


gcccttcgcc 


cgcccgccct 


cgcgccaccc cgcgtcaccc cacgtcaccg 


33695 


cacgtcaccc 


cggccccgcc 


tcgctcctcc 


ccgctcatta tcatattggc acgtttccag 


33755 


aataaggtat 


attattgatg 


cagcaaaaca 


atccctctgg gagcaatcac aaaatcctcc 


33815 


ggtgaaaaaa 


gcacatacat 


attagaataa 


ccctgttgct ggggcaaaaa ggcccgtcgt 


33875 


cccagcaaat 


gcacataaat 


atgttcatca 


gccattgccc cgtcttaccg cgtaaacagc 


33935 


cacgaaaaaa 


tcgagctaaa 


atccacccaa 


cagcctatag ctatatatac actccaccca 


33995 


atgacgctaa 


taccgcacca 


cccacgacca 


aagttcaccc acacccacaa aacccgcgaa 


34055 


aatccagcgc 


cgtcagcact 


tccgcaattt 


cagtctcaca acgtcacttc cgcgcgcctt 


34115 


J- A —1.1.1. 

txcactttcc 


cacacacgcc 


cttcgcccgc 


ccgccctcgc gccaccccgc gtcaccccac 


34179 


gtcaccgcac 


gtcaccccgg 


ccccgcctcg 


ctcctccccg ctcattatca tattggcacg 


34235 


tttccagaat 


aaggtatatt 


attgatgca 




34264 
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<210> 25 

<211> 503 

<212> PRT 

<213> simian adenovirus SV-1 

<400> 25 

Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro 
1 ~ 5 10 15 

Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu 
20 25 30 

Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg 
35 • 40 45 

Asn Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys 
50 55 60 

Val Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr 
65 70 75 80 

Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp 
85 90 95 

Phe Thr Pro Thr Glu Ala Gly Thr Gin Thr lie Asn Phe Asp Glu Arg 
100 105 110 

Ser Arg Trp Gly Gly Gin Leu Lys Thr lie Leu His Thr Asn Met Pro 
115 ~ 120 125 

Asn lie Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg Leu Met 
130 135 140 

Val Lys Lys Ala Glu Asn Gin Pro Pro Glu Tyr Glu Trp Phe Glu Phe 
145 ^ 150 155 160 

Thr lie Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr lie Asp Leu Met 
165 170 175 

Asn Asn Ala He Val Asp Asn Tyr Leu Gin Val Gly Arg Gin Asn Gly 
180 185 190 

Val Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg 
195 200 205 

Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr 
210 " 215 220 

Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val 
225 230 235 240 

Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg 
245 250 255 

Arg Pro Phe Gin Glu Gly Phe Gin He Met Tyr Glu Asp Leu Glu Gly 
260 265 270 
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Gly Asn lie Pro Gly Leu Leu Asp Val Pro Ala Tyr Glu Glu Ser Val 
275 280 285 

Lys Gin Ala Glu Ala Gin Gly Arg Glu He Arg Gly Asp Thr Phe Ala 
290 295 300 

Thr Glu Pro His Glu Leu Val He Lys Pro Leu Glu Gin Asp Ser Lys 
305 310 315 320 

Lys Arg Ser Tyr Asn He He Ser Gly Thr Met Asn Thr Leu Tyr Arg 
325 330 335 

Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 
340 345 350 

Ser Trp Thr He Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gin Gin 
355 360 365 

Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg 
370 375 380 

Pro Ser Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu 
385 390 395 400 

Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin 
405 410 415 

Leu He Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro 
420 425 430 

Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val 
435 440 445 

Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg 
450 455 460 

Ser Ser He Ser Gly Val Gin Arg Val Thr He Thr Asp Ala Arg Arg 
465 470 475 480 

Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys 
485 490 495 

Val Leu Ser Ser Arg Thr Phe 
500 



<210> 26 
<211> 931 
<212> PRT 

<213> simian adenovirus SV-1 
<400> 26 

Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His He Ala 
15 10 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Axg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys Val 
130 135 140 

Arg Ala Gin Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala He Thr 
145 150 155 160 

Lys Glu Gly He Gin Val Gly Val Thr Leu Thr Asp Ser Gly Ser Thr 
165 170 175 

Pro Gin Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin He Gly Glu 
180 185 190 

Leu Gin Trp Asn Ser Asp Val Gly Thr Asp Asp Lys He Ala Gly Arg 
195 200 205 

Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala 
210 215 220 

Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Thr Pro Ser Ala Ser Gin 
225 230 235 240 

Asp Val Gin Asn Pro Glu Leu Gin Phe Phe Ala Ser Thr Asn Val Ala 
245 250 255 

Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser He Glu Ma 
260 . 265 270 

Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly He Thr 
275 280 285 

Ser Ser Glu Ala Leu Leu Thr Gin Gin Ma Ala Pro Asn Arg Pro Asn 
290 295 300 

Tyr He Ala Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser 
305 310 315 320 
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Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala 
325 330 335 

Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met 
340 ^ 345 350 

Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin 
355 ~ 360 365 

Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His Gly 
370 375 380 

Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Ala 
385 * 390 395 400 

Val Thr Asp Thr Tyr Ser Pro He Lys Val Asn Gly Gly Gly Asn Gly 
405 410 415 

Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu He Gly 
420 425 430 

Ser Gly Asn Met Phe Ala Met Glu He Asn Leu Gin Ala Asn Leu Trp 
435 440 445 

Arg Ser Phe Leu Tyr Ser Asn He Gly Leu Tyr Leu Pro Asp Ser Leu 
450 455 460 

Lys He Thr Pro Asp Asn He Thr Leu Pro Glu Asn Lys Asn Thr Tyr 
465 470 475 480 

Gin Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr Tyr 
485 490 495 

Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser He Asn 
500 505 510 

Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu 
515 520 525 

Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin Lys 
530 * 535 540 

Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr 
545 550 555 560 

Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser Ser Leu 
565 570 575 

Gly Asn Asp Leu Arg Val Asp Gly Ala Ser He Arg Phe Asp Ser He 
580 " 585 590 

Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr 
595 600 605 

Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp 
610 615 620 
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Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro He Pro Ala Asn Ala Thr 
625 630 635 640 

Ser Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly 
645 650 655 

Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser 
660 ' 665 670 

Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser He Pro Tyr Leu Asp 
675 680 685 

Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He Met Phe 
690 695 700 

Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn 
705 710 715 720 

Glu Phe Glu He Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val Ala 
725 730 735 

Gin Ser Asn Met Thr Lys Asp Trp Phe Leu He Gin Met Leu Ser His 
740 745 750 

Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp 
755 760 765 

Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin He 
770 775 780 

Val Asp Ser Thr Ala Tyr Thr Asn Tyr Gin Asp Val Lys Leu Pro Tyr 
785 790 795 800 

Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr Met Arg 
805 810 815 

Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He Gly Ala 
820 825 830 

Thr Ala Val Pro Ser Leu Thr Gin Lys Lys Phe Leu Cys Asp Arg Val 
835 840 845 

Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ser Leu 
850 " 855 860 

Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu 
865 870 875 880 

Asp Met Thr Phe Glu Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr 
885 890 895 

Val Leu Phe Glu Val Phe Asp Val Val Arg He His Gin Pro His Arg 
900 905 910 
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Gly Val He Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn 
915 920 925 

Ala Thr Thr 
930 



<210> 27 

<211> 363 

<212> PRT 

<213> simian adenovirus SV-1 

<400> 27 

Met Lys Arg Thr Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 
1 " 5 10 15 

Asp Thr Thr Thr Thr Pro Ala Val Pro Phe He Ser Pro Pro Phe Val 
20 25 30 

Asn Ser Asp Gly Leu Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Arg 
35 40 45 

He Ala Lys Pro Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser 
50 55 60 

Leu Gly Arg Gly Leu Thr He Thr Ala Ala Gly Gin Leu Glu Ser Thr 
65 70 75 80 

Gin Ser Val Gin Thr Asn Pro Pro Leu He He Thr Asn Asn Asn Thr 
85 90 95 

Leu Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu 
100 105 110 

Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu Thr 
115 120 125 

Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu Thr Phe 
130 135 140 

Asn Tyr Thr Glu Pro Leu Lys Leu He Asn Asn Ser Leu Ala He Asp 
145 150 155 160 

He Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly Ser Leu Ala Val 
165 170 175 

Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn Gly Ser He Ala Phe 
180 185 190 

Gly He Gin Thr Leu Trp Thr Ala Pro Thr Ser Thr Gly Asn Cys Thr 
195 200 205 

Val Tyr Ser Glu Gly Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys 
210 215 220 
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Gly Ala His Val Leu Gly Ser Val Ser Leu Thr Gly Leu Thr Gly Thr 
225 230 235 240 

He Thr Gin Met Thr Asp He Ser Val Thr He Gin Phe Thr Phe Asp 
245 250 255 

Asn Asn Gly Lys Leu Leu Ser Ser Pro Leu He Asn Asn Ala Phe Ser 
260 265 270 

He Arg Gin Asn Asp Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu 
275 280 285 

Ala Phe Met Pro Asn Ser Thr He Tyr Ala Arg Gly Gly Gly Gly Glu 
290 295 300 

Pro Arg Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Gin 
305 310 315 320 

Lys Pro He He Leu Thr Val Thr Tyr Asn Ser Val Ala Thr Gly Tyr 
325 330 335 

Ser Leu Ser Phe Lys Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr 
340 345 350 

Pro Thr Thr Ser Phe Cys Tyr He Thr Glu Gin 
355 360 



<210> 28 
<211> 560 
<212> PRT 

<213> simian adenovirus SV~1 
<400> 28 

Met Lys Arg Ala Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 
1^5 10 15 

Asp Pro Pro His Ala Pro Val Met Pro Phe He Thr Pro Pro Phe Thr 
20 25 30 

Ser Ser Asp Gly Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn 
35 " 40 45 

Tyr Arg Asp Pro lie Thr Thr Gin Asn Glu Ser Leu Thr He Lys Leu 
50 55 60 

Gly Asn Gly Leu Thr Leu Asp Asn Gin Gly Gin Leu Thr Ser Thr Ala 
65 70 75 80 

Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 
85 90 95 

Val Tyr Ser Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser 
100 105 HO 
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His Thr Ala Pro Leu Val He Ala Asp Asn Ser Leu Ma Leu Gin Val 
115 120 125 

Ser Glu Pro He Phe He Asn Asp Lys Asp Lys Leu Ala Leu Gin Thr 
130 135 140 

Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser Ala 
145 150 155 160 

Ala Pro Leu Gly He Ala Asp Gin Thr Leu Lys Leu Leu Phe Thr Asn 
165 170 175 

Pro Leu Tyr Leu Gin Asn Asn Phe Leu Thr Leu Ala He Glu Arg Pro 
180 185 190 

Leu Ala He Thr Asn Thr Gly Lys Leu Ala Leu Gin Leu Ser Pro Pro 
195 200 205 

Leu Gin Thr Ala Asp Thr Gly Leu Thr Leu Gin Thr Asn Val Pro Leu 
210 215 220 

Thr Val Ser Asn Gly Thr Leu Gly Leu Ala lie Lys Arg Pro Leu He 
225 230 235 240 

He Gin Asp Asn Asn Leu Phe Leu Asp Phe Arg Ala Pro Leu Arg Leu 
245 250 255 

Phe Asn Ser Asp Pro Val Leu Gly Leu Asn Phe Tyr Thr Pro Leu Ala 
260 265 270 

Val Arg Asp Glu Ala Leu Thr Val Asn Thr Gly Arg Gly Leu Thr Val 
275 280 285 

Ser Tyr Asp Gly Leu He Leu Asn Leu Gly Lys Asp Leu Arg Phe Asp 
290 295 300 

Asn Asn Thr Val Ser Val Ala Leu Ser Ala Ala Leu Pro Leu Gin Tyr 
305 310 " 315 320 

Thr Asp Gin Leu Arg Leu Asn Val Gly Ala Gly Leu Arg Tyr Asn Pro 
325 330 335 

Val Ser Lys Lys Leu Asp Val Asn Pro Asn Gin Asn Lys Gly Leu Thr 
340 345 350 

Trp Glu Asn Asp Tyr Leu He Val Lys Leu Gly Asn Gly Leu Gly Phe 
355 * 360 365 

Asp Gly Asp Gly Asn He Ala Val Ser Pro Gin Val Thr Ser Pro Asp 
370 375 380 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser He Tyr Thr 
385 390 395 400 

Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gin Gly Gly Val 
405 410 415 
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Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Ser 
420 425 430 

Pro Thr Glu Ser Ala lie Val He He Leu His Phe Asp Asn Tyr Gly 
435 440 445 

Val Arg He Leu Asn Tyr Pro Thr Leu Gly Thr Gin Gly Thr Leu Gly 
450 455 460 

Asn Asn Ala Thr Trp Gly Tyr Arg Gin Gly Glu Ser Ala Asp Thr Asn 
465 470 475 480 

Val Leu Asn Ala Leu Ala Phe Met Pro Ser Ser Lys Arg Tyr Pro Arg 
485 490 495 

Gly Arg Gly Ser Glu Val Gin Asn Gin Thr Val Gly Tyr Thr Cys He 
500 505 510 

Gin Gly Asp Phe Ser Met Pro Val Pro Tyr Gin He Gin Tyr Asn Tyr 
515 520 525 

Gly Pro Thr Gly Tyr Ser Phe Lys Phe He Trp Arg Thr Val Ser Arg 
530 " 535 540 

Gin Pro Phe Asp He Pro Cys Cys Phe Phe Ser Tyr He Thr Glu Glu 
545 ' 550 555 560 



<210> 29 

<211> 31044 

<212> DNA 

<213> simian adenovirus SV-25 

<220> 

<221> CDS 

<222> (12284) (13801) 

<223> Penton 

<220> 

<221> CDS 

<222> (16681) .. (19446) 

<223> Hexon 

<220> 

<221> CDS 

<222> (25380) (26423) 

<223> Fiber #2 

<220> 

<221> CDS 

<222> (26457) .. (28136) 
<223> Fiber #1 

<400> 29 

catcatcaat aatatacctt attctggaaa 



cgtgccaata tgataatgag cggggaggag 
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cgaggcgggg ccggggtgac gtgcggtgac gcggggtggc gcgagggcgg ggcgaagggc 120 

gcgggtgtgt gtgtgggagg cgcttagttt ttacgtatgc ggaaggaggt tttataccgg 180 

aagatgggta atttgggcgt atacttgtaa gttttgtgta atttggcgcg aaaactgggt 240 

aatgaggaag ttgaggttaa tatgtacttt ttatgactgg gcggaatttc tgctgatcag 300 

cagtgaactt tgggcgctga cggggaggtt tcgctacgtg acagtaccac gagaaggctc 360 

aaaggtccca tttattgtac tcttcagcgt tttcgctggg tatttaaacg ctgtcagatc 420 

atcaagaggc cactcttgag tgctggcgag aagagttttc tcctccgtgc tgccacgatg 480 

aggctggtcc ccgagatgta cggtgttttt agcgacgaga cggtgcgtaa ctcagatgac 540 

ctgctgaatt cagacgcgct ggaaatttcc aattcgcctg tgctttcgcc gccgtcactt 600 

cacgacctgt ttgtgttttg gctcaacgct tagcaacgtg ttatataggg tcaagaagga 660 

gcaggagacg cagtttgcta ggctgttggc cgatactcct ggagtttttg tggctctgga 720 

tctaggccat cactctcttt tccaagagaa aattatcaaa aacttaactt ttacgtctcc 780 
tggtcgcacg gttgcttccg ctgcctttat tacctatatt ttggatcaat ggagcaacag 840 
cgacagccac ctgtcgtggg agtacatgct ggattacatg tcgatggcgc tgtggagggc 900 
catgctgcgg aggagggttt gcatttactt gcgggcgcag cctccgcggc tggaccgagt 960 

ggaggaggag gacgagccgg gggagaccga gaacctgagg gccgggctgg accctccaac 1020 

ggaggactag gtgctgagga tgatcccgaa gaggggacta gtggggctag gaagaagcaa 1080 

aagactgagt ctgaacctcg aaactttttg aatgagttga ctgtgagttt gatgaatcgt 1140 

cagcgtccgg agacaatttt ctggtctgaa ttggaggagg aattcaggag gggggaactg 1200 

aacctgctat acaagtatgg gtttgaacag ttaaaaactc actggttgga gccgtgggag 1260 

gattttgaaa ccgccttgga cacttttgct aaagtggctc tgcggccgga taaggtttac 1320 

actatccgcc gcactgttaa cataaagaag agtgtttatg ttataggcca tggagctctg 1380 

gtgcaggtgc aaaccgtcga ccgggtggcc tttagttgcg gtatgcaaaa tctgggcccc 1440 

ggggtgatag gcttaaatgg tgtaacattt cacaatgtaa ggtttactgg tgaaagtttt 1500 

aacggctctg tgtttgcaaa taacacacag ctgacgctcc acggcgttta cttttttaac 1560 

tttaataaca catgtgtgga gtcgtggggc agggtgtctt tgaggggctg ctgttttcac 1620 

ggctgctgga aggcggtggt gggaagactt aaaagtgtaa catctgtaaa aaaatgcgtg 1680 

tttgagcggt gtgtgttggc tttaactgtg gagggctgtg gacgcattag gaataatgcg 1740 
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gcgtctgaga atggatgttt tcttttgcta aaaggcacgg ctagtattaa gcataacatg 1800 

atatgcggca gcggtctgta cccttcacag ctgttaactt gcgcggatgg aaactgtcag 1860 

accttgcgca ccgtgcacat agcgtcccac cagcgccgcg cctggccaac attcgagcac 1920 

aatatgctta tgcgttgtgc cgtccacttg ggccctaggc gaggcgtgtt tgtgccttac 1980 

cagtgtaact ttagccatac caagatttta ctagaacctg ataccttctc tcgagtgtgt 2040 

ttcaatgggg tgtttgacat gtcaatggaa ctgtttaaag tgataagata tgatgaatcc 2100 

aagtctcgtt gtcgcccatg tgaatgcgga gctaatcatc tgaggttgta tcctgtaacc 2160 

ctaaacgtta ccgaggagct gaggacggat caccacatgt tgtcctgcct gcgcaccgac 2220 

tatgaatcca gcgacgagga gtgaggtgag gggcggagcc acaaagggta taaaggggcg 2280 

tgaggggtgg gtgtgatgat tcaaaatgag cgggacgacg gacggcaacg cgtttgaggg 2340 

tggagtgttc agcccttatc tgacatctcg tcttccttcc tgggcaggag tgcgtcagaa 2400 

tgtagtgggc tccaccgtgg acggacgacc ggtcgcccct gcaaattccg ccaccctcac 2460 

ctatgccacc gtgggatcat cgttggacac tgccgcggca gctgccgctt ctgctgccgc 2520 

ttctactgct cgcggcatgg cggctgattt tggactgtat aaccaactgg ccactgcagc 2580 

tgtggcgtct cggtctctgg ttcaagaaga tgccctgaat gtgatcctga ctcgcctgga 2640 
gatcatgtca cgtcgcttgg acgaactggc tgcgcagata tcccaagcta accccgatac 
cacttcagaa tcctaaaata aagacaaaca aatatgttga aaagtaaaat ggctttattt 

gttttttttg gctcggtagg ctcgggtcca cctgtctcgg tcgttaagaa ctttgtgtat 2820 

gttttccaaa acacggtaca gatgggcttg gatgttcaag tacatgggca tgaggccatc 2880 

tttggggtga agataggacc attgaagagc gtcatgctcc ggggtggtgt tgtaaattac 2940 

ccagtcgtag cagggtttct gggcgtggaa ctggaagatg tcctttagga gtaggctgat 3000 

ggccaagggc aggcccttag tgtaggtgtt tacaaagcgg ttaagctggg agggatgcat 3060 

gcggggggag atgatatgca tcttggcttg gatcttgagg ttagctatgt taccacccag 3120 

gtctctgcgg gggttcatgt tatgaaggac caccagcacg gtgtagccgg tgcatttggg 3180 

gaacttgtca tgcagtttgg aggggaaggc gtggaagaat ttagagaccc ccttgtggcc 3240 

ccctaggttt tccatgcact catccataat gatggcaatg ggacccctgg cggccgcttt 3300 

ggcaaacacg ttttgggggt tggaaacatc atagttttgc tctagagtga gctcatcata 3360 

ggccatctta acaaagcggg gtaggagggt gcccgactgg gggatgatag ttccatctgg 3420 

gcctggggcg tagttaccct cacagatctg catctcccag gccttaattt ccgagggggg 3480 



2700 
2760 
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tatcatgtcc acctgggggg caataaagaa cacggtttct ggcgggggat tgatgagctg 3540 

ggtggaaagc aagttacgca gcagttgaga tttgccacag ccggtggggc cgtagatgac 3600 

cccgatgacg ggttgcagct ggtagttgag agaggaacag ctgccgtcgg ggcgcaggag 3660 

gggggctacc tcattcatca tgcttctaac atgtttattt tcactcacta agttttgcaa 3720 

gagcctctcc ccacccaggg ataagagttc ttccaggctg ttgaagtgtt tcagcggttt 3780 

taggccgtcg gccatgggca tcttttcgag cgactgacga agcaagtaca gtcggtccca 3840 

gagctcggtg acgtgctcta tggaatctcg atccagcaga cttcttggtt gcgggggttg 3900 

ggtcgacttt cgctgtaggg caccagccgg tgggcgtcca gggccgcgag ggttctgtcc 3960 

ttccagggtc tcagcgtccg ggtgagggtg gtctcggtga cggtgaaggg atgagccccg 4020 

ggctgggcgc ttgcgagggt gcgcttcagg ctcatcctgc tggtgctgaa gcggacgtcg 4080 

tctccctgtg agtcggccag atagcaacga agcatgaggt cgtagctgag ggactcggcc 4140 

gcgtgtccct tggcgcgcag ctttcccttg gaaacgtgct gacatttggt gcagtgcaga 4200 

cattggaggg cgtagagttt gggggccagg aagaccgact cgggcgagta ggcgtcggct 4260 

ccgcactgag cgcagacggt ctcgcactcc actagccacg tgagctcggg tttagcggga 4320 

tcaaaaacca agttgcctcc attttttttg atgcgtttct taccttgcgt ttccatgagt 4380 

ttgtggcccg cttccgtgac aaaaaggctg tcggtgtctc cgtagacaga cttgaggggg 4440 

cgatcttcca aaggtgttcc gaggtcttcc gcgtacagga actgggacca ctccgagacg 4500 

aaggctctgg tccaggctaa cacgaaggag gcaatctgcg aggggtatct gtcgttttca 4560 

atgagggggt ccaccttttc cagggtgtgc agacacaggt cgtcctcctc cgcgtccacg 4620 

aaggtgattg gcttgtaagt gtaggtcacg tgatctgcac cccccaaagg ggtataaaag 4680 

ggggcgtgcc caccctctcc gtcactttct tccgcatcgc tgtggaccag agccagctgt 4740 

tcgggtgagt aggccctctc aaaagccggc atgatctcgg cgctcaagtt gtcagtttct 4800 

acaaacgagg tggatttgat attcacgtgc cccgcggcga tgcttttgat ggtggagggg 4860 

tccatctgat cagaaaacac gatctttttg ttgtcaagtt tggtggcgaa agacccgtag 4920 

agggcgttgg aaagcaactt ggcgatggag cgcagggtct gatttttctc ccgatcggcc 4980 

ctctccttgg cggcgatgtt gagttgcacg tactcccggg ccgcgcaccg ccactcgggg 5040 

aacacggcgg tgcgctcgtc gggcaggatg cgcacgcgcc agccgcgatt gtgcagggtg 5100 

atgaggtcca cgctggtagc cacctccccg cggaggggct cgttggtcca acacaatcgc 5160 
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cccccttttc tggagcagaa cggaggcagg ggatctagca agttggcggg cggggggtcg 5220 

gcgtcgatgg tgaagatacc gggtagcagg atcttattaa aataatcgat ttcggtgtcc 5280 

gtgtcttgca acgcgtcttc ccacttcttc accgccaggg ccctttcgta gggattcagg 5340 

ggcggtcccc agggcatggg gtgggtcagg gccgaggcgt acatgccgca gatgtcatac 5400 

acgtacaggg gttccctcaa caccccgatg taagtggggt aacagcgccc cccgcggatg 5460 

ctggctcgca cgtagtcgta catctcgcgc gagggagcca tgaggccgtc tcccaagtgg 5520 

gtcttgtggg gtttttcggc ccggtagagg atctgtctga agatggcgtg ggagttggaa 5580 

gagatggtgg ggcgttggaa gacgttaaag ttggccccgg gtagtcccac ggagtcttgg 5640 

atgaactggg cgtaggattc ccggagtttg tccaccaggg cggcggtcac cagcacgtcg 5700 

agagcgcagt agtccaacgt ctcgcggacc aggttgtagg ccgtctcttg ttttttctcc 5760 

cacagttcgc ggttgaggag gtattcctcg cggtctttcc agtactcttc ggcgggaaat 5820 

cctttttcgt ccgctcggta agaacctaac atgtaaaatt cgttcaccgc tttgtatgga 5880 

caacagcctt tttctaccgg cagggcgtac gcttgagcgg cctttctgag agaggtgtgg 5940 

gtgagggcga aggtgtcccg caccatcact ttcaggtact gatgtttgaa gtccgtgtcg 6000 

tcgcaggcgc cctgttccca cagcgtgaag tcggtgcgct ttttctgcct gggattgggg 6060 

agggcgaagg tgacatcgtt aaagagtatt ttcccggcgc ggggcatgaa gttgcgagag 6120 

atcctgaagg gcccgggcac gtccgagcgg ttgttgatga cctgcgccgc caggacgatc 6180 

tcgtcgaagc cgttgatgtt gtgacccacg atgtaaagtt cgatgaagcg cggctgtccc 6240 

ttgagggccg gcgctttttt caactcctcg taggtgagac agtccggcga ggagagaccc 6300 

agctcagccc gggcccagtc ggagagttga ggattagccg caaggaagga gctccataga 6360 

tccaaggcca ggagagtttg caagcggtcg cggaactcgc ggaacttttt ccccacggcc 6420 

attttctccg gtgtcactac gtaaaaggtg ttggggcggt tgttccacac gtcccatcgg 6480 

agctctaggg ccagctcgca ggcttggcga acgagggtct cctcgccaga gacgtgcatg 6540 

accagcataa agggtaccaa ctgtttcccg aacgagccca tccatgtgta ggtttctacg 6600 

tcgtaggtga caaagagccg ctgggtgcgc gcgtgggagc cgatcggaaa gaagctgatc 6660 

tcctgccacc agctggagga atgggtgtta atgtggtgga agtagaagtc ccgccggcgc 6720 

acagagcatt cgtgctgatg tttgtaaaag cgaccgcagt agtcgcagcg ctgcacgctc 6780 

tgtatctcct gaacgagatg cgcttttcgc ccgcgcacca gaaaccggag ggggaagttg 6840 

agacgggggg ctggtggggc gacatcccct tcgccttggc ggtgggagtc tgcgtctgcg 6900 
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tcctccttct ctgggtggac gacggtgggg acgacgacgc cccgggtgcc gcaagtccag 6960 

atctccgcca cggaggggtg caggcgctgc aggaggggac gcagctgccc gctgtccagg 7020 

gagtcgaggg aagtcgcgct gaggtcggcg ggaagcgttt gcaagttcac tttcagaaga 7080 

ccggtaagag cgtgagccag gtgcagatgg tacttgattt ccaggggggt gttggatgaa 7140 

gcgtccacgg cgtagaggag tccgtgtccg cgcggggcca ccaccgtgcc ccgaggaggt 7200 

tttatctcac tcgtcgaggg cgagcgccgg ggggtagagg cggctctgcg ccggggggca 7260 

gcggaggcag aggcacgttt tcgtgaggat tcggcagcgg ttgatgacga gcccggagac 7320 

tgctggcgtg ggcgacgacg cggcggttga ggtcctggat gtgccgtctc tgcgtgaaga 7380 

ccaccggccc ccgggtcctg aacctaaaga gagttccaca gaatcaatgt ctgcatcgtt 7440 

aacggcggcc tgcctgagga tctcctgcac gtcgcccgag ttgtcctgat aggcgatctc 7500 

ggccatgaac tgttccactt cttcctcgcg gaggtcaccg tggcccgctc gctccacggt 7560 

ggcggccagg tcgttggaga tgcggcgcat gagttgagag aaggcgttga ggccgttctc 7620 

gttccacacg cggctgtaca ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg 7680 

ggccacgttg agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag 7740 

gtagttgagc gtggtggcga tgtgctcgca gacgaagaag tacataatcc agcgccgcag 7800 

ggtcatctcg ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtcgac 7860 

ggcgaagttg aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg 7920 

gatgagatcg gcgaccgtgt cgcgcacctc ctgttcgaaa gcgccccgag gcgcctctgc 7980 

ttcttcctcc ggctcctcct cttccagggg ctcgggttcc tccggcagct ctgcgacggg 8040 

gacggggcgg cgacgtcgtc gtctgaccgg caggcggtcc acgaagcgct cgatcatttc 8100 

gccgcgccgg cgacgcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc 8160 

gaagacgccg ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg 8220 
cagggacacg gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga 8280 
tctgagaacg tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc 8340 
gcaatcgcaa ggtaagctga gaacggtggg tcgctggggg gcgttcgcgg gcagttggga 8400 
ggtgatgctg ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag 8460 
gaggaccacg tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc 8520 
ctcgctctga cagcgacgca ggtctttgta gaagtcttgc atcagtctct ccaccggaac 
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ctctgcttct cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa 8640 

cgctaggtcg gccacgaccc tttcggccag cacggcctgt tgaatctgcg tgagggtggc 8700 

ctggaagtcg tccaggtcca cgaagcggtg ataggccccc gtgttgatgg tgtaggtgca 8760 

gttggccatg acggaccagt tgacgacttg catgccgggt tgggtgatct ccgtgtactt 8820 

gaggcgcgag taggccctgg actcgaacac gtagtcgttg catgtgcgca ccagatactg 8880 

gtagccgacc aggaagtgag gaggcggctc tcggtacagg ggccagccaa cggtggcggg 8940 

ggcgccgggg gacaggtcgt ccagcatgag gcggtggtag tggtagatgt agcgggagag 9000 

ccaggtgatg ccggccgagg tggttgcggc cctggtgaat tcgcggacgc ggttccagat 9060 

gttgcgcagg ggaccaaagc gctccatggt gggcacgctc tgccccgtga ggcgggcgca 9120 

atcttgtacg ctctagatgg aaaaaagaca gggcggtcat cgactccttt ccgtagcttg 9180 

gggggtaaag tcgcaagggt gcggcggcgg ggaaccccgg ttcgagaccg gccggatccg 9240 

ccgctcccga tgcgcctggc cccgcatcca cgacgtccgc gccgagaccc agccgcgacg 9300 

ctccgcccca atacggaggg gagtcttttg gtgttttttc gtagatgcat ccggtgctgc 9360 

ggcagatgcg accccagacg cccactacca ccgccgtggc ggcagtaaac ctgagcggag 9420 

gcggtgacag ggaggaggaa gagctggctt tagacctgga agagggagag gggctggccc 9480 

ggctgggagc gccatcccca gagagacacc ctagggttca gctcgtgagg gacgccaggc 9540 

aggcttttgt gccgaagcag aacctgttta gggaccgcag cggtcaggag gcggaggaga 9600 

tgcgcgattg caggtttcgg gcgggcagag agctcagggc gggcttcgat cgggagcggc 9660 

tcctgagggc ggaggatttc gagcccgacg agcgttctgg ggtgagcccg gcccgcgctc 9720 

acgtatcggc ggccaacctg gtgagcgcgt acgagcagac ggtgaacgag gagcgcaact 9780 

tccaaaagag ctttaacaat cacgtgagga ccctgatcgc gagggaggag gtgaccatcg 9840 

ggctgatgca tctgtgggac ttcgtggagg cctacgtgca gaacccggct agcaaacccc 9900 

tgacggccca gctgttcctg atcgtgcagc acagccgcga caacgagacg ttccgcgacg 9960 

ccatgttgaa catcgcggag cccgagggtc gctggctctt ggatctgatt aacatcctgc 10020 

agagcatcgt ggtgcaggag aggggcctga gtttagcgga caaggtggcg gccattaact 10080 

attcgatgca gagcctgggg aagttctacg ctcgcaagat ctacaagagc ccttacgtgc 10140 

ccatagacaa ggaggtgaag atagacagct tttacatgcg catggcgctg aaggtgctga 10200 

cgctgagcga cgacctcggc gtgtaccgta acgacaagat ccacaaggcg gtgagcgcca 10260 

gccgccggcg ggagctgagc gacagggagc tgatgcacag cctgcagagg gcgctggcgg 10320 
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gcgccgggga 


cgaggagcgc 


gaggcttact 


tcgacatggg 


agccgatctg 


cagtggcgtc 


10380 


ccagcgcgcg 


cgccttggag 


gcggcgggtt 


atcccgacga 


ggaggatcgg 


gacgatttgg 


10440 


aggaggcagg 


cgagtacgag 


gacgaagcct 


gaccgggcag 


gtgttgtttt 


agatgcagcg 


10500 


gccggcggac 


gggaccaccg 


cggatcccgc 


acttttggca 


tccatgcaga 


gtcaaccttc 


10560 


gggcgtgacc 


gcctccgatg 


actgggcggc 


ggccatggac 


cgcatcatgg 


cgctgaccac 


10620 


ccgcaacccc 


gaggctttta 


ggcagcaacc 


ccaggccaac 


cgtttttcgg 


ccatcttgga 


10680 


agcggtggtg 


ccgtcgcgca 


ccaacccgac 


gcacgagaaa 


gtcctgacta 


tcgtgaacgc 


10740 


cctggtagac 


agcaaggcca 


tccgccgtga 


cgaggcgggc 


ttgatttaca 


acgctctttt 


10800 


ggaacgcgtg 


gcgcgctaca 


acagcactaa 


cgtgcagacc 


aatctggacc 


gcctcaccac 


10860 


cgacgtgaag 


gaggcgctgg 


cgcagaagga 


gcggtttctg 


agggacagta 


atctgggctc 


10920 


tctggtggca 


ctgaacgcct 


tcctgagctc 


acagccggcc 


aacgtgcccc 


gcgggcagga 


10980 


ggattacgtg 


agcttcatca 


gcgctctgag 


actgctggtg 


tccgaggtgc 


cccagagcga 


11040 


ggtgtaccag 


tctgggccgg 


attacttttt 


ccagacgtcc 


cgacagggct 


tgcaaacggt 


11100 


gaacctgact 


caggccttta 


aaaacttgca 


aggcatgtgg 


ggggtcaagg 


ccccggtggg 


11160 


cgatcgcgcc 


actatctcca 


gtctgctgac 


ccccaacact 


cgcctgctgc 


tgctcttgat 


11220 


cgcaccgttt 


accaacagta 


gcactatcag 


ccgtgactcg 


tacctgggtc 


atctcatcac 


11280 


tctgtaccgc 


gaggccatcg 


gccaggctca 


gatcgacgag 


catacgtatc 


aggagattac 


11340 


taacgtgagc 


cgtgccctgg 


gtcaggaaga 


taccggcagc 


ctggaagcca 


cgttgaactt 


11400 


tttgctaacc 


aaccggaggc 


aaaaaatacc 


ctcccagttc 


acgttaagcg 


ccgaggagga 


11460 


gaggattctg 


cgatacgtgc 


agcagtccgt 


gagcctgtac 


ttgatgcgcg 


agggcgccac 


11520 


cgcttccacg 


gctttagaca 


tgacggctcg 


gaacatggaa 


ccgtcctttt 


actccgccca 


11580 


ccggccgttc 


attaaccgtc 


tgatggacta 


cttccatcgc 


gcggccgcca 


tgaacgggga 


11640 


gtacttcacc 


aatgccatcc 


tgaatccgca 


ttggatgccc 


ccgtccggct 


tctacaccgg 


11700 


ggagtttgac 


ctgcccgaag 


ccgacgacgg 


ctttctgtgg 


gacgacgtgt 


ccgatagcat 


11760 


tttcacgccg 


gctaatcgcc 


gattccagaa 


gaaggagggc 


ggagacgagc 


tccccctctc 


11820 


cagcgtggaa 


gcggcctcaa 


ggggagagag 


tccctttcca 


agtctgtctt 


ccgccagtag 


11880 


cggtcgggta 


acgcgtccac 


ggttgccggg 


ggagagcgac 


tacctgaacg 


accccttgct 


11940 


gcgaccggct 


agaaagaaaa 


attttcccaa 


taacggggtg 


gaaagcttgg 


tggataaaat 


12000 
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gaatcgttgg aagacgtacg cccaggagca gcgggagtgg gaggacagtc agccgcggcc 12060 

gctggtaccg ccgcattggc gtcgccagag agaagacccg gacgactccg cagacgatag 12120 

tagcgtgttg gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg 12180 

gcgctcgagt cgcctgtatt aataaaaaag acgcggaaac ttaccagagc catggccaca 12240 

gcgtgtgtgc tttcttcctc tctttcttcc tcggcgcggc aga atg aga aga gcg 12295 
* ' Met Arg Arg Ala 

1 

gtg aga gtc acg ccg gcg gcg tat gag ggc ccg ccc cct tct tac gaa 12343 
Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu 
5 10 15 20 

age gtg atg gga tea gcg aac gtg ccg gec acg ctg gag gcg cct tac 12391 
Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr 
25 30 35 



gtt cct ccc aga tac ctg gga cct acg gag ggc aga aac age ate cgt 
Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser lie Arg 
40 45 50 

tac tec gag ctg gcg ccc ctg tac gat ace acc aag gtg tac ctg gtg 
Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val 
55 60 65 

gac aac aag teg gcg gac ate gec tec ctg aat tac caa aac gat cac 
Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn Asp His 
70 75 80 

agt aac ttt ctg act acc gtg gtg cag aac aat gac ttc acc ccg acg 
Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr 
85 90 95 100 



ggt cag ctg aaa acc ate ctg cac acc aac atg ccc aac ate aac gag 
Gly Gin Leu Lys Thr lie Leu His Thr Asn Met Pro Asn He Asn Glu 
120 125 130 

ttc atg tec acc aac aag ttc agg get aag ctg atg gta gaa aaa agt 
Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met Val Glu Lys Ser 
135 140 145 



12439 



12487 



12535 



12583 



gag gcg ggc acg cag acc att aac ttt gac gag cgt tec cgc tgg ggc 12631 
Glu Ala Gly Thr Gin Thr He Asn Phe Asp Glu Arg Ser Arg Trp Gly 
105 HO H5 



12679 



12727 



aat gcg gaa act egg cag ccc cga tac gag tgg ttc gag ttt acc att 12775 

Asn Ala Glu Thr Arg Gin Pro Arg Tyr Glu Trp Phe Glu Phe Thr He 
150 155 160 

cca gag ggc aac tat tec gaa act atg act ate gat etc atg aat aac 12823 

Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr He Asp Leu Met Asn Asn 

165 170 175 180 
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gcg ate gtg gac aat tac ctg caa gtg ggg aga cag aac ggg gtg ctg 12871 
Ala lie Val Asp Asn Tyr Leu Gin Val Gly Arg Gin Asn Gly Val Leu 
185 ~ 190 " 195 

gaa age gat ate ggc gtg aaa ttc gat ace aga aac ttc cga ctg ggg 12919 
Glu Ser Asp lie Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly 
200 205 210 

tgg gat ccc gtg ace aag ctg gtg atg cca ggc gtg tac ace aac gag 12967 
Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu 
215 220 225 

get ttt cac ccg gac ate gtg ctg ctg ccg ggg tgc ggt gtg gac ttc 13015 
Ala Phe His Pro Asp lie Val Leu Leu Pro Gly Cys Gly Val Asp Phe 
230 235 240 

act cag age cgt ttg agt aac ctg tta gga att aga aag cgc cgc ccc 13063 
Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly lie Arg Lys Arg Arg Pro 
245 250 255 " ^ 260 

ttc caa gag ggc ttt caa ate atg tat gag gac ctg gag gga ggt aat 13111 
Phe Gin Glu Gly Phe Gin lie Met Tyr Glu Asp Leu Glu Gly Gly Asn 
265 270 " 275 

ata ccc gec tta ctg gac gtg teg aag tac gaa get age ata caa cgc 13159 
lie Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala Ser He Gin Arg 
280 285 290 

gee aaa gcg gag ggt aga gag att egg gga gac ace ttt gcg gta get 13207 
Ala Lys Ala Glu Gly Arg Glu He Arg Gly Asp Thr Phe Ala Val Ala 
295 300 305 

ccc cag gac ctg gaa ata gtg cct tta act aaa gac age aaa gac aga 13255 
Pro Gin Asp Leu Glu He Val Pro Leu Thr Lys Asp Ser Lys Asp Arg 
310 315 320 

age tac aat att ata aac aac acg acg gac ace ctg tat egg age tgg 13303 
Ser Tyr Asn He He Asn Asn Thr Thr Asp Thr Leu Tyr Arg Ser Trp 
325 330 335 340 

ttt ctg get tac aac tac gga gac ccc gag aaa gga gtg aga tea tgg 13351 
Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp 
345 350 355 

acc ata etc ace ace acg gac gtg acc tgt ggc teg cag caa gtg tac 13399 
Thr He Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gin Gin Val Tyr 
360 365 370 

tgg tec ctg ccg gat atg atg caa gac ccg gtc acc ttc cgc ccc tec 13447 
Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Pro Ser 
375 380 385 

acc caa gtc age aac ttc ccg gtg gtg ggc acc gag ctg ctg ccc gtc 13495 
Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val 
390 395 400 
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cat gcc aag age ttc tac aac gag cag gec gtc tac teg caa ctt att 13543 
His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin Leu lie 
405 410 415 420 

cgc cag tec acc gcg ctt acc cac gtg ttc aat cgc ttt ccc gag aac 13591 
Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn 
425 430 435 

cag att ctg gtg cgc cct ccc get cct acc att acc acc gtc agt gaa 
Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu 
440 445 450 

aac gtt ccc gcc etc aca gat cac gga acc ctg ccg ctg cgc age agt 
Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser 
455 460 465 

ate agt gga gtt cag cgc gtg acc ate acc gac gcc aga cgt cga acc 13735 
He Ser Gly Val Gin Arg Val Thr He Thr Asp Ala Arg Arg Arg Thr 
470 475 480 



13639 



13687 



tgc ccc tac gtt tac aaa gcg ctt ggc gtg gtg get cct aaa gtt ctt 
Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu 
485 490 495 500 



13783 



tct agt cgc acc ttc taa aaacatgtcc atcctcatct ctcccgataa 13831 
Ser Ser Arg Thr Phe 
505 



caacaccggc tggggactgg 


gctccggcaa 


gatgtaegge 


ggagccaaaa 


ggcgctccag 


13891 


tcagcaccca gttcgagttc 


ggggecaett 


ccgcgctcct 


tggggagctt 


acaagegagg 


13951 


actctegggt egaaeggctg 


tagacgatac 


catagatgee 


gtgattgccg 


acgcccgccg 


14011 


gtacaacccc ggaceggteg 


ctagcgccgc 


ctccaccgtg 


gattccgtga 


tegacagegt 


14071 


ggtagcegge gctcgggcct 


atgctcgccg 


caagaggegg 


ctgcatcgga 


gacgtcgccc 


14131 


caccgccgcc atgctggcag 


ccagggccgt 


getgaggegg 


geceggaggg 


caggcagaag 


14191 


ggctatgege cgcgctgccg 


ccaacgccgc 


egcegggagg 


gcccgccgac 


aggctgcccg 


14251 


ccaggctgcc gctgccatcg 


ctagcatggc 


cagacccagg 


agagggaacg 


tgtactgggt 


14311 


gcgtgattct gtgaegggag 


tccgagtgcc 


ggtgcgcagc 


cgacctcccc 


gaagttagaa 


14371 


gatccaagct gegaagaegg 


eggtactgag 


tctccctgtt 


gttatcagee 


caacatgagc 


14431 


aagcgcaagt ttaaagaaga 


actgetgeag 


acgctggtgc 


ctgagatcta 


tggccctccg 


14491 


gaegtgaage cagacattaa 


gccccgcgat 


ateaagegtg 


ttaaaaagcg 


ggaaaagaaa 


14551 


gaggaactcg cggtggtaga 


cgatggcgga 


gtggaattta 


ttaggagttt 


cgccccgcga 


14611 


cgcagggttc aatggaaagg 


gcggcgggta 


caacgcgttt 


tgaggccggg 


caccgcggta 


14671 


gtttttaccc egggagageg 


gtcggccgtt 


aggggtttca 


aaaggcagta 


cgacgaggtg 


14731 
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tacggcgacg aggacatatt ggaacaggcg gctcaacaga tcggagaatt tgcctacgga 14791 

aagcgttcgc gtcgcgaaga cctggccatc gccttagaca gcggcaaccc cacgcccagc 14851 

ctcaaacccg tgacgctgca gcaggtgctt cccgtgagcg ccagcacgga cagcaagagg 14911 

gggattaaga gagaaatgga agatctgcat cccaccatcc aactcatggt ccctaaacgg 14971 

cagaggctgg aagaggtcct ggagaagatg aaagtggacc ccagcataga gccggatgta 15031 

aaagtcagac ctattaagga agtggccccc ggtcttgggg tgcaaacggt ggacattcaa 15091 

atccccgtca ccaccgcttc aaccgccgtg gaagctatgg aaacgcaaac ggagacccct 15151 

gccgcgatcg gtaccaggga agtggcgttg caaacggagc cttggtacga atacgcagcc 15211 

cctcggcgtc agaggcgttc cgctcgttac ggccccgcca acgccatcat gccagaatat 15271 

gcgctgcatc cgtctattct gcccactccc ggataccggg gtgtgacgta tcgcccgtct 15331 

ggaacccgcc gccgaacccg tcgccgccgc cgctcccgtc gcgctctggc ccccgtgtcg 15391 

gtgcggcgtg tgacccgccg gggaaagaca gtcgtcattc ccaacccgcg ttaccaccct 15451 

agcatccttt aataactctg ccgttttgca gatggctctg acttgccgcg tgcgccttcc 15511 

cgttccgcac tatcgaggaa gatctcgtcg taggagaggc atgacgggca gtggtcgccg 15571 

gcgggctttg cgcaggcgca tgaaaggcgg aattttaccc gccctgatac ccataattgc 15631 

cgccgccatc ggtgccatac ccggcgttgc ttcagtggcg ttgcaagcag ctcgtaataa 15691 

ataaacaaag gcttttgcac ttatgacctg gtcctgacta ttttatgcag aaagagcatg 15751 

gaagacatca attttacgtc gctggctccg cggcacggct cgcggccgct catgggcacc 15811 

tggaacgaca tcggcaccag tcagctcaac gggggcgctt tcaattgggg gagcctttgg 15871 

agcggcatta aaaactttgg ctccacgatt aaatcctacg gcagcaaagc ctggaacagt 15931 

agtgctggtc agatgctccg agataaactg aaggacacca acttccaaga aaaagtggtc 15991 

aatggggtgg tgaccggcat ccacggcgcg gtagatctcg ccaaccaagc ggtgcagaaa 16051 

gagattgaca ggcgtttgga aagctcgcgg gtgccgccgc agagagggga tgaggtggag 16111 

gtcgaggaag tagaagtaga ggaaaagctg cccccgctgg agaaagttcc cggtgcgcct 16171 

ccgagaccgc agaagcggcc caggccagaa ctagaagaga ctctggtgac ggagagcaag 16231 

gagcctccct cgtacgagca agccttgaaa gagggcgcct ctccaccctc ctacccgatg 16291 

actaagccga tcgcacccat ggctcgaccg gtgtacggca aggattacaa gcccgtcacg 16351 

ctagagctgc ccccaccgcc ccccacgcgc ccgaccgtcc cccccctgcc gactccgtcg 16411 
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gcggccgcgg cgggacccgt gtccgcacca tccgctgtgc ctctgccagc cgcccgtcca 16471 

gtggccgtgg ccactgccag aaaccccaga ggccagagag gagccaactg gcaaagcacg 16531 

ctgaacagca tcgtgggcct gggagtgaaa agcctgaaac gccgccgttg ctattattaa 16591 

aaaagtgtag ctaaaaagtc tcccgttgta tacgcctcct atgttaccgc cagagacgag 16651 

tgactgtcgc cgcgagcgcc gctttcaag atg gcc acc cca teg atg atg ccg 
9 y * • Met Ala Thr Pro Ser Met Met Pro 



16704 



510 



16752 



16800 



16848 



16896 



cag tgg tct tac atg cac ate gcc ggc cag gac gcc teg gag tac ctg 
Gin Trp Ser Tyr Met His lie Ala Gly Gin Asp Ala Ser Glu Tyr Leu 
515 520 525 

agt ccc ggc etc gtg cag ttt gcc cgc gcc acc gac acc tac ttc age 
Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser 
530 535 540 545 

ttg gga aac aag ttt aga aac ccc acc gtg gcc ccc acc cac gat gtg 
Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val 
550 " 555 560 

acc acg gac cgc teg cag agg ctg acc ctg cgc ttt gtg ccc gta gac 
Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp 
565 570 575 

egg gag gac acc gcg tac tct tac aaa gtg cgc tac acg ttg gcc gta 16944 
Arg Glu Asp Thr Ala Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val 
580 585 590 

ggg gac aac cga gtg ctg gac atg gcc age acc tac ttt gac ate egg 
Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp lie Arg 
595 600 60S 

ggg gtg ctg gat egg ggt ccc age ttc aag ccc tat tec ggc acc get 
Glv Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala 
610 615 • 620 625 

tac aac tec ctg gcc ccc aag gga get ccc aac ccc teg gaa tgg acg 
Tvr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Pro Ser Glu Trp Thr 
* 630 635 640 

gac act tec gac aac aaa ctt aaa gca tat get cag get ccc tac cag 
Asp Thr Ser Asp Asn Lys Leu Lys Ala Tyr Ala Gin Ala Pro Tyr Gin 
645 650 655 

agt caa gga ctt aca aag gat ggt att cag gtt ggg eta gtt gtg aca 
Ser Gin Gly Leu Thr Lys Asp Gly He Gin Val Gly Leu Val Val Thr 
660 665 670 

gag tea gga caa aca ccc caa tat gca aac aaa gtg tac caa ccc gag 
Glu Ser Gly Gin Thr Pro Gin Tyr Ala Asn Lys Val Tyr Gin Pro Glu 
675 680 685 



16992 



17040 



17088 



17136 



17184 



17232 
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cca caa att ggg gaa aac caa tgg aat tta gaa caa gaa gat aaa gcg 17280 
Pro Gin lie Gly Glu Asn Gin Trp Asn Leu Glu Gin Glu Asp Lys Ala 
690 695 700 705 

gcg gga aga gtc eta aag aaa gat acc cct atg ttt ccc tgc tat ggg 17328 
Ala Gly Arg Val Leu Lys Lys Asp Thr Pro Met Phe Pro Cys Tyr Gly 
710 715 720 

tea tat gec agg ccc aca aac gaa caa gga ggg cag gca aaa aac caa 17376 
Ser Tyr Ala Arg Pro Thr Asn Glu Gin Gly Gly Gin Ala Lys Asn Gin 
725 730 735 

gaa gta gat tta cag ttt ttt gec act ccg ggc gac acc cag aac acg 17424 
Glu Val Asp Leu Gin Phe Phe Ala Thr Pro Gly Asp Thr Gin Asn Thr 
740 745 750 

get aaa gtg gta ctt tat get gaa aat gtc aac ctg gaa act cca gat 17472 
Ala Lys Val Val Leu Tyr Ala Glu Asn Val Asn Leu Glu Thr Pro Asp 
755 760 765 

act cac tta gtg ttt aaa ccc gat gac gac age acc agt tea aaa ctt 17520 
Thr His Leu Val Phe Lys Pro Asp Asp Asp Ser Thr Ser Ser Lys Leu 
770 775 780 785 

ctt ctt ggg cag cag get gca cct aac aga ccc aac tac ata ggt ttt 17568 
Leu Leu Gly Gin Gin Ala Ala Pro Asn Arg Pro Asn Tyr He Gly Phe 
790 795 800 

aga gat aat ttt att ggt tta atg tac tac aat age act gga aac atg 17616 
Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met 
805 810 815 

ggc gtg ctg gee gga cag get tct caa ttg aat gec gta gtc gac ttg 17664 
Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu 
820 825 830 

cag gac aga aac acc gag ttg tec tac cag ctg atg ctg gac gca ctg 17712 
Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met Leu Asp Ala Leu 
835 840 845 

ggg gat cgc age cga tat ttt tea atg tgg aat cag gca gta gac age 17760 
Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser 
850 855 860 865 

tat gac cca gac gtt aga att ata gaa aac cac gga gtg gaa gac gaa 17808 
Tyr Asp Pro Asp Val Arg lie He Glu Asn His Gly Val Glu Asp Glu 
870 875 880 

ctg cca aac tat tgt ttt cct ctg gga gga atg gtg gtg act gac aat 17856 
Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Val Val Thr Asp Asn 
885 890 895 

tac aac tct gtg acg cct caa aat gga ggc agt gga aat aca tgg cag 17904 
Tyr Asn Ser Val Thr Pro Gin Asn Gly Gly Ser Gly Asn Thr Trp Gin 
900 905 910 
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gca gac aat act aca ttt agt caa aga gga gcg cag att ggc tec gga 
Ala Asp Asn Thr Thr Phe Ser Gin Arg Gly Ala Gin He Gly Ser Gly 
915 920 925 

aac atg ttt gec ctg gaa att aac eta cag gec aac etc tgg cgc ggc 
Asn Met Phe Ala Leu Glu He Asn Leu Gin Ala Asn Leu Trp Arg Gly 
930 935 940 945 

ttc ttg tat tec aat att ggg ttg tat ctt cca gac tct ctg aaa ate 
Phe Leu Tyr Ser Asn He Gly Leu Tyr Leu Pro Asp Ser Leu Lys He 
950 955 960 

acc ccc gac aac ate acg ctg cca gaa aac aaa aac act tat cag tac 
Thr Pro Asp Asn He Thr Leu Pro Glu Asn Lys Asn Thr Tyr Gin Tyr 
965 970 975 

atg aac ggt cgc gta acg cca ccc ggg etc ata gac acc tat gta aac 
Met Asn Gly Arg Val Thr Pro Pro Gly Leu He Asp Thr Tyr Val Asn 
980 985 990 



gtg ggc gcg cgc tgg tec ccc gat gtc atg gac age 
Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser 
995 1000 1005 



att aac ccc ttc 
He Asn Pro Phe 



aac cac cac cgt aac gcg ggc ttg cgc tac cgc tec atg etc ttg 
Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu 
1010 1015 1020 



ggc aac ggc cgt tat gtg 
Gly Asn Gly Arg Tyr Val 
1025 1030 



cct ttt cac att cag gtg ccc caa aaa 
Pro Phe His lie Gin Val Pro Gin Lys 
1035 



ttc ttt gec att aaa aac ctg ctg ctt etc ccc ggt tec tat acc 
Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr 
1040 ^ 1045 1050 



ate ctg cag age 
He Leu Gin Ser 



tat gag tgg aac ttc cgc aag gat gtc aac atg 
Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 
1055 1060 1065 

teg ctg ggt aat gac ctg cga gtg gac ggg gee age ata cgc ttt 
Ser Leu Gly Asn Asp Leu • Arg Val Asp Gly Ala Ser He Arg Phe 
1070 1075 1080 

gac age att aac ctg tat gec aac ttt ttt ccc atg gec cac aac 
Asp Ser He Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn 
1085 1090 1095 

acg gee tct acc ctg gaa gee atg ctg cgc aac gac acc aat gac 
Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp 
1100 1105 HIO 

cag tec ttc aac gac tac ctg tgc gcg get aac atg ctg tac ccc 
Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro 
1115 1120 H25 



17952 



18000 



18048 



18096 



18144 



18192 



18237 



18282 



18327 



18372 



18417 



18462 



18507 



18552 
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ate ccc gec aac gec acc age gtg ccc att tct att cct tct egg 

He Pro Ala Asn Ala Thr Ser Val Pro He Ser He Pro Ser Arg 

1130 1135 1140 

aac tgg get gec ttc agg ggc tgg agt ttt act cgc etc aaa acc 

Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr 

1145 1150 1155 

aag gag act ccc teg ctg ggc tec ggt ttt gac ccc tac ttt gtt 

Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val 

1160 1165 1170 



tac tec ggc tec att ccc 
Tyr Ser Gly Ser He Pro 
1175 



tac eta gat ggc acc ttt tac etc aac 
Tyr Leu Asp Gly Thr Phe Tyr Leu Asn 

1180 *" lies 



cac act ttc aaa aag gtg tct att atg ttt gac tec teg gtt age 
His Thr Phe Lys Lys Val Ser He Met Phe Asp Ser Ser Val Ser 
1190 1195 1200 



tgg ccc ggc aac gac cgc ctg eta acg ccc aac 
Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn 
1205 " 1210 1215 



gag ttc gaa att 
Glu Phe Glu He 



aag cgt tec gtg gac ggt 
Lys Arg Ser Val Asp Gly 
1220 1225 



gaa ggg tac aac gtg gee cag age aac 
Glu Gly Tyr Asn Val Ala Gin Ser Asn 
1230 



atg acc aag gac tgg ttt . eta att caa atg etc agt cac tat aat 

Met Thr Lys Asp Trp Phe Leu He Gin Met Leu Ser His Tyr Asn 
1235 ~ 1240 1245 

ata ggt tac cag ggc ttc tat gtg ccc gag aac tac aag gac cgc 

He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp Arg 
1250 1255 1260 



atg tac tec ttc ttc cgc aac ttc caa cca atg 
Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met 
1265 1270- 1275 



age egg cag gtg 
Ser Arg Gin Val 



18597 



18642 



18687 



18732 



18777 



18822 



18867 



18912 



18957 



19002 



gta gat acc gtg act tat aca gac tac aaa gat gtc aag etc ccc 19047 
Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys Asp Val Lys Leu Pro 
1280 1285 1290 

tac caa cac aac aac tea ggg ttc gtg ggc tac atg gga ccc acc 19092 
Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
1295 1300 1305 

atg cga gag gga cag gee ' tac ccg gec aac tat ccc tac ccc ctg 19137 
Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 
1310 " 1315 1320 

ate gga gag act gec gta ccc age etc acg cag aaa aag ttc etc 19182 
He Gly Glu Thr Ala Val Pro Ser Leu Thr Gin Lys Lys Phe Leu 
1325 " 1330 1335 
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tgc gac egg gtg atg tgg .agg ata ccc ttc tct age aac ttt atg 19227 
Cys Asp Arg Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met 
1340 1345 1350 

teg atg ggc tec etc acc gac ctg ggg cag aac atg ctg tac gcc 19272 
Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 
1355 " 1360 1365 

aac tec get cac gcc ttg gac atg act ttt gag gtg gat ccc atg 
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met 
1370 1375- 1380 

gat gag ccc acg ctt etc tat gtt ctg ttt gaa gtc ttc gac gtg 
Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val 
1385 1390 1395 

gtg cgc ate cac cag ccg cac cgc ggc gtc ate gag gcc gtc tac 
Val Arg He His Gin Pro His Arg Gly Val He Glu Ala Val Tyr 
1400 1405 1410 

ctg cgc aca cct ttc tct ' gcc ggt aac gcc acc acc taa agaagctgat 19456 
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
1415 1420 1425 

gggttccagc gaacaggagt tgeaggecat tgttcgcgac ctgggctgcg ggccctgctt 19516 

tttgggcacc ttcgacaagc gttttccegg attcatgtcc ccccacaagc cggcctgcgc 19576 

categttaac acggccggac gggagacagg gggggtgcac tggctcgcct tcgcctggaa 19636 

cccgcgcaac cgcacctgct acctgttcga cccttttggt ttctccgacg aaaggctgaa 19696 

gcagatctac caattcgagt acgaggggct cctcaagcgc agegctctgg cctccacgcc 19756 

cgaccactgc gtcaccctgg aaaagtccac ccagacggtc caggggcccc tctcggccgc 19816 

ctgegggett ttctgttgca tgtttttgca cgccttcgtg cactggcctc acacccccat 19876 

ggagegcaac cccaccatgg atctgctcac cggagtgccc aacagcatgc ttcacagtcc 19936 

ccaggtcgcc cccaccctgc gtegcaatea ggaccacctg tategcttte tggggaa'aca 19996 

ctctgcctat ttccgccgcc accggcagcg catcgaacag gccacggcct tcgaaagcat 20056 

gagecaaaga gtgtaatcaa taaaaaccgt ttttatttga catgataege gcttctggcg 20116 

tttttattaa aaatcgaagg gttcgaggga ggggtcctcg tgcccgctgg ggagggacac 20176 

gttgeggtae tggaatcggg cgctccaacg aaactcgggg atcaccagcc geggcaggge 20236 

cacgtcttcc atgttctgct tccaaaactg tcgcaccagc tgeagggetc ccatcacgtc 20296 

gggegctgag atcttgaagt cgcagttagg gccggagccc ccgcggctgt tgeggaacac 20356 

ggggttggca cactggaaca ccaacacgct ggggttgtgg atactageca gggcegtegg 20416 

gtcggtcacc tecgatgeat ccagatcctc ggcattgetc agggegaacg gggtcagctt 20476 

166 



WO 03/046124 



PCT/US02/33645 



gcacatctgc cgcccgatct ggggtaccag gtcgcgcttg ttgaggcagt cgcagcgcag 20536 

agggatgagg atgcgacgct gcccgcgttg catgatgggg taactcgccg cca^rgaactc 20596 

ctctatctga cggaaggcca tctgggcctt gacgccctcg gtgaaaaata gcccacagga 20656 

cttgctggaa aacacgttat tgccacagtt gatgtcttcc gcgcagcagc gcgcatcttc 20716 

gttcttcagc tgaaccacgt tgcgacccca gcggttctga accaccttgg ctttcgtggg 20776 

atgctccttc agcgcccgct gtccgttctc gctggtcaca tccatttcca ccacgtgctc 20836 

cttgcagacc atctccactc cgtggaaaca gaacagaatg ccctcctgtt gggtattgcg 20896 

atgctcccac acggcgcacc cggtggactc ccagctcttg tgtttcaccc ccgcgtaggc 20956 

ttccatgtaa gccattagaa atctgcccat cagctcagtg aaggtcttct ggttggtgaa 21016 

ggttagcggc aggccgcggt gttcctcgtt caaccaagtt tgacagatct tgcggtacac 21076 

ggctccctgg tcgggcagaa acttaaaagt cgttctgctc tcgttgtcca cgtggaactt 21136 

ctccatcaac atcgtcatga cttccatgcc cttctcccag gcagtcacca gcggcgcgct 21196 

ctcggggttc ttcaccaaca cggcggtgga ggggccctcg ccggccccga cgtccttcat 21256 

ggacattttt tgaaactcca cggtgccgtc cgcgcggcgt actctgcgca tcggagggta 21316 

gctgaagccc acctccatga cggtgctttc gccctcgctg tcggagacga tctccgggga 21376 

gggcggcgga acgggggcag acttgcgagc cttcttcttg ggagggagcg gaggcacctc 21436 

ctgctcgcgc tcgggactca tctcccgcaa gtagggggtg atggagcttc ctggttggtt 21496 

ctgacggttg gccattgtat cctaggcaga aagacatgga gcttatgcgc gaggaaactt 21556 

taaccgcccc gtcccccgtc agcgacgaag aggtcatcgt cgaacaggac ccgggctacg 21616 

ttacgccgcc cgaggatctg gaggggccct tagacgaccg gcgcgacgct agtgagcggc 21676 

aggaaaatga gaaagaggag gaggagggct gctacctcct ggaaggcgac gttttgctaa 21736 

agcatttcgc caggcagagc accatactca aggaggcctt gcaagaccgc tccgaggtgc 21796 

ccttggacgt cgccgcgctc tcccaggcct acgaggcgaa ccttttctcg ccccgagtgc 21856 

ctccgaagag acagcccaac ggcacctgcg agcccaaccc gcgactcaac ttctaccccg 21916 

tgttcgccgt gcccgaggcg ctggccacct accacatctt tttcaaaaac cagcgcattc 21976 

ccctttcctg ccgggccaac cgcaccgcgg ccgataggaa gctaacactc agaaacggag 22036 

tcagcatacc tgatatcacg tcactggagg aagtgcctaa gatcttcgag ggtctgggtc 22096 

gagatgagaa gcgggcggcg aacgctctgc agaaagaaca gaaagagagt cagaacgtgc 221S6 
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tggtggagct ggagggggac aacgcgcgtc tgaccgtcct caaacgttgc atagaagttt 22216 
cccacttcgc ctacccggcc ctcaacctgc cgcccaaagt tatgaaatcg gtcatggacc 22276 
agctactcat caagagagct gagcccctga atcccgacca ccctgaggcg gaaaactcag 22336 
aggacggaaa gcccgtcgtc agcgacgagg agctcgagcg gtggctggaa accagggacc 22396 
cccagcagtt gcaagagagg cgcaagatga tgatggcggc cgtgctggtc acggtggagc 22456 
tagaatgcct gcaacggttt ttcagcgacg tggagacgct acgcaaaatc ggggagtccc 22516 
tgcactacac cttccgccag ggctacgttc gccaggcctg caaaatctcc aacgtagagc 22576 
tcagcaacct ggtttcctac atgggcatcc tccacgagaa ccggctgggg cagagcgtgc 22636 
tgcactgcac cttgcaaggc gaggcgcgaa gggactacgt ccgagactgc gtctacctct 22696 
tcctcaccct cacctggcag accgccatgg gcgtgtggca gcagtgcttg gaagagagaa 22756 
acctcaaaga gctggacaaa ctcctctgcc gccagcggcg ggccctctgg accggcttca 22816 
gcgagcgcac ggtcgcctgc gccctggcag acatcatttt cccagaacgc ctgatgaaaa 22876 
ccttgcagaa cggcctgccg gatttcatca gtcagagcat cttgcaaaac ttccgctcct 22936 
tcgtcctgga gcgctccggg atcttgcccg ccatgagctg cgcgctgcct tctgactttg 22996 
tccccctttc ctaccgcgag tgccctcccc cactgtggag ccactgctac ctcttccaac 23056 
tggccaactt tctggcctac cactccgacc tcatggaaga cgtgagcgga gaggggctgc 23116 
tcgagtgcca ctgccgctgc aacctctgca ccccccacag atcgctggcc tgcaacaccg 23176 
agctgctcag cgaaacccag gtcataggta ccttcgagat ccaggggccc cagcagcaag 23236 
agggtgcttc cggcttgaag ctcactccgg cgctgtggac ctcggcttac ttacgcaaat 23296 
ttgtagccga ggactaccac gcccacaaaa ttcagtttta cgaagaccaa tctcgaccac 23356 
cgaaagcccc cctcacggcc tgcgtcatca cccagagcaa aatcctggcc caattgcaat 23416 
ccatcaacca agcgcgccga gatttccttt tgaaaaaggg tcggggggtg tacctggacc 23476 
cccagaccgg cgaggaactc aacccgtcca cactttccgt cgaagcagcc cccccgagac 23536 
atgccaccca agggaaccgc caagcagctg atcgctcggc agagagcgaa gaagcaagag 23596 
ctgctccagc agcaggtgga ggacgaggaa gagctgtggg acagccaggc agaggaggtg 23656 
tcagaggacg aggaggagat ggaaagctgg gacagcctag acgaggagga cgagctttca 23716 
gaggaagagg cgaccgaaga aaaaccacct gcatccagcg cgccttctct gagccgacag 23776 
ccgaagcccc ggcccccgac gcccccggcc ggctcactca aagccagccg taggtgggac 23836 
gccaccggat ctccagcggc agcggcaacg gcagcgggta aggccaaacg cgagcggcgg 23896 
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gggtattgct cctggcggac ccacaaaagc agtatcgtga actgcttgca acactgcggg 23956 

ggaaacatct cctttgcccg acgctacctc ctcttccatc acggtgtggc cttccctcgc 24016 

aacgttctct attattaccg tcatctctac agcccctacg aaacgctcgg agaaaaaagc 24076 

taaggcctcc tctgccgcga ggaaaaactc cgccgccgct gccgccaagg atccgccggc 24136 

caccgaggag ctgagaaagc gcatctttcc cactctgtat gctatctttc agcaaagccg 24196 

cgggcagcac cctcagcgcg aactgaaaat aaaaaaccgc tccttccgct cactcacccg 24256 

cagctgtctg taccacaaga gagaagacca gctgcagcgc accctggacg acgccgaagc 24316 

actgttcagc aaatactgct cagcgtctct taaagactaa aagacccgcg ctttttcccc 24376 

ctcgggcgcc aaaacccacg tcatcgccag catgagcaag gagattccca ccccttacat 24436 

gtggagctat cagccccaga tgggcctggc cgcgggggcc gcccaggact actccagcaa 24496 

aatgaactgg ctcagcgccg gcccccacat gatctcacga gttaacggca tccgagccca 24556 

ccgaaaccag atcctcttag aacaggcggc aatcaccgcc acaccccggc gccaactcaa 24616 

cccgcccagt tggcccgccg cccaggtgta tcaggaaact ccccgcccga ccacagtcct 24676 

cctgccacgc gacgcggagg ccgaagtcct catgactaac tctggggtac aattagcggg 24736 

cgggtccagg tacgccaggt acagaggtcg ggccgctcct tactctcccg ggagtataaa 24796 

gagggtgatc attcgaggcc gaggtatcca gctcaacgac gaggcggtga gctcctcaac 24856 

cggtctcaga cctgacggag tcttccagct cggaggagcg ggccgctctt ccttcaccac 24916 

tcgccaggcc tacctgaccc tgcagagctc ttcctcgcag ccgcgctccg ggggaatcgg 24976 

cactctccag ttcgtggaag agttcgtccc ctccgtctac ttcaacccgt tttccggctc 25036 * 

acctggacgc tacccggacg ccttcattcc caactttgac gcagtgagtg aatccgtgga 25096 

cggctacgac tgatgacaga tggtgcggcc gtgagagctc ggctgcgaca tctgcatcac 25156 

tgccgccagc ctcgctgcta cgctcgggag gcgatcgtgt tcagctactt tgagctgccg 25216 

gacgagcacc ctcagggacc ggctcacggg ttgaaactcg agattgagaa cgcgcttgag 25276 

tctcacctca tcgacgcctt caccgcccgg cctctcctgg tagaaaccga acgcgggatc 25336 

actaccatca ccctgttctg catctgcccc acgcccggat tac atg aag ate tgt 25391 

Met Lys He Cys 
1430 

gtt gtc ate ttt gcg etc agt tta ata aaa act gaa ctt ttt gcc 25436 
Val Val He Phe Ala Leu Ser Leu He Lys Thr Glu Leu Phe Ala 
1435 1440 1445 
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gta cct tea acg cca cgc gtt gtt tct cct tgt gaa aaa acc cca 25481 
Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys Thr Pro 
1450 1455 1460 

gga gtc ctt aac tta cac ata gca aaa ccc ttg tat ttt acc ata 25526 
Gly Val Leu Asn Leu His He Ala Lys Pro Leu Tyr Phe Thr He 
1465 1470 1475 

gaa aaa caa eta gec ctt tea att gga aaa ggg tta aca att tct 25571 
Glu Lys Gin Leu Ala Leu Ser He Gly Lys Gly Leu Thr He Ser 
1480 1485 1490 

get aca gga cag ttg gaa age aca gca age gta cag gac age get 25616 
Ala Thr Gly Gin Leu Glu Ser Thr Ala Ser Val Gin Asp Ser Ala 
1495 1500 1505 

aca cca ccc eta cgt ggt att tec cct tta aag ctg aca gac aac 25661 
Thr Pro Pro Leu Arg Gly He Ser Pro Leu Lys Leu Thr Asp Asn 
1510 1515 1520 

ggt tta aca tta age tat tea gat ccc ctg cgt gtg gta ggt gac 25706 
Gly Leu Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp 
1525 1530 1535 

caa ctt acg ttt aat ttt act tct cca eta cgt tac gaa aat ggc 25751 
Gin Leu Thr Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Gly 
1540 1545 1550 

agt ctt aca ttc aac tac act tct ccc atg aca eta ata aac aac 25796 
Ser Leu Thr Phe Asn Tyr Thr Ser Pro Met Thr Leu He Asn Asn 
1555 1560 1565 

agt ctt get att aac gtc aat acc tec aaa ggc etc agt agt gac 25841 
Ser Leu Ala He Asn Val Asn Thr Ser Lys Gly Leu Ser Ser Asp 
1570 1575 1580 

aac ggc aca etc get gta aat gtt act cca gat ttt aga ttt aac 25886 
Asn Gly Thr Leu Ala Val Asn Val Thr Pro Asp Phe Arg Phe Asn 
1585 1590 1595 

age tct ggt gee tta act ttt ggc ata caa agt eta tgg act ttt 25931 
Ser Ser Gly Ala Leu Thr Phe Gly He Gin Ser Leu Trp Thr Phe 
1600 1605 1610 

cca acc aaa act cct aac. tgt acc gtg ttt acc gaa agt gac tec 25976 
Pro Thr Lys Thr Pro Asn Cys Thr Val Phe Thr Glu Ser Asp Ser 
1615 1620 1625 

ctg ctg agt ctt tgc ttg act aaa tgc gga get cac gta ctt gga 26021 
Leu Leu Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly 
1630 1635 1640 

age gtg agt tta age gga gtg gca gga acc atg eta aaa atg acc 26066 
Ser Val Ser Leu Ser Gly Val Ala Gly Thr Met Leu Lys Met Thr 
1645 1650 1655 
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cac act tct gtt acc gtt cag ttt teg ttt gat gac agt ggt aaa 
His Thr Ser Val Thr Val Gin Phe Ser Phe Asp Asp Ser Gly Lys 
1660 1665 1670 



eta ata ttc tct cca 
Leu lie Phe Ser Pro 
1675 



ctt gcg aac aac act tgg ggt gtt cga caa 
Leu Ala Asn Asn Thr Trp Gly Val Arg Gin 
1680 1685 



age gag agt ccg ttg ccc aac cca tec ttc aac get etc acg ttt 
Ser Glu Ser Pro Leu Pro Asn Pro Ser Phe Asn Ala Leu Thr Phe 
1690 1695 1700 

atg cca aac agt acc att tat tct aga gga gca agt aac gaa cct 
Met Pro Asn Ser Thr lie Tyr Ser Arg Gly Ala Ser Asn Glu Pro 
1705 1710 1715 

caa aac aat tat tat gtc cag acg tat ctt aga ggc aac gtg cga 
Gin Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Arg 
1720 1725 1730 

aag cca att eta eta act gtt acc tac aac tea gtt aat tea gga 
Lys Pro He Leu Leu Thr Val Thr Tyr Asn Ser Val Asn Ser Gly 
1735 1740 1745 

tat tec tta act ttt aaa tgg gat get gtc gee aat gaa aaa ttt 
Tyr Ser Leu Thr Phe Lys Trp Asp Ala Val Ala Asn Glu Lys Phe 
1750 1755 1760 

gee act cct aca tct teg ttt tgc tat gtt gca gag caa taa 
Ala Thr Pro Thr Ser Ser Phe Cys Tyr Val Ala Glu Gin 
1765 1770 

aaccctgtta ccccaccgtc tcgttttttt cag atg aaa cga gcg aga gtt 

Met Lys Arg Ala Arg Val 
1775 



26111 



26156 



26201 



26246 



26291 



26336 



26381 



26423 



26474 



gat gaa gac ttc aac cca ■ gtg tac cct tat gac ccc cca tac get 
Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro Tyr Ma 
1780 " 1785 1790 

ccc gtc atg ccc ttc att act ccg cct ttt acc tec teg gat ggg 
Pro Val Met Pro Phe He Thr Pro Pro Phe Thr Ser Ser Asp Gly 
1795 1800 1805 

ttg cag gaa aaa cca ctt gga gtg tta agt tta aac tac agg gat 
Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp 
1810 181$ 1820 

ccc att act aca caa aat ggg tct etc acg tta aaa eta gga aac 
Pro He Thr Thr Gin Asn Gly Ser Leu Thr Leu Lys Leu Gly Asn 
1825 1830 1835 



ggc 
Gly 
1840 



etc act eta aac aac 
Leu Thr Leu Asn Asn 
1845 



cag gga cag tta aca tea act get ggc 
Gin Gly Gin Leu Thr Ser Thr Ala Gly 
1850 



26519 



26564 



26609 



26654 



26699 
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gaa gtg gag cct ccg etc act aat get aac aac aaa ctt gca eta 
Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 
1855 1860 1865 



26744 



gee tat age gaa cca tta gca gta aaa age aac cgc eta act eta 
Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu 
1870 1875 1880 



26789 



tea cac ace get ccc ctt gtc ate get aat aat tct tta gcg ttg 
Ser His Thr Ala Pro Leu Val lie Ala Asn Asn Ser Leu Ala Leu 
1885 1890 1895 



26834 



caa gtt tea gag cct att ttt gta aat gac gat gac aag eta gee 
Gin Val Ser Glu Pro He Phe Val Asn Asp Asp Asp Lys Leu Ala 
1900 1905 1910 



26879 



ctg cag aca gee gee ccc ctt gta ace aac get ggc acc ctt cgc 
Leu Gin Thr Ala Ala Pro- Leu Val Thr Asn Ala Gly Thr Leu Arg 
1915 1920 1925 



26924 



tta cag age get gec cct tta gga ttg gtt gaa aat act ctt aaa 
Leu Gin Ser Ala Ala Pro Leu Gly Leu Val Glu Asn Thr Leu Lys 
1930 1935 ~ 1940 



26969 



ctg ctg ttt tct aaa ccc ttg tat ttg caa aat gat ttt ctt gca 
Leu Leu Phe Ser Lys Pro Leu Tyr Leu Gin Asn Asp Phe Leu Ala 
1945 1950 1955 



27014 



tta gee att gaa cgc ccc 
Leu Ala He Glu Arg Pro 
1960 1965 



ctg get gta gca gee gca ggt act ctg 
Leu Ala Val Ala Ala Ala Gly Thr Leu 
1970 



27059 



acc eta caa ctt act cct cca tta aag act aac gat gac ggg eta 
Thr Leu Gin Leu Thr Pro Pro Leu Lys Thr Asn Asp Asp Gly Leu 
1975 1980 1985 



27104 



aca eta tec aca gtc gag. cca tta act gta aaa aac gga aac eta 
Thr Leu Ser Thr Val Glu Pro Leu Thr Val Lys Asn Gly Asn Leu 
1990 1995 2000 



27149 



ggc ttg caa ata teg cgc cct tta gtt gtt caa aac aac ggc ctt 
Gly Leu Gin He Ser Arg Pro Leu Val Val Gin Asn Asn Gly Leu 
2005 2010 2015 



27194 



teg ctt get att acc ccc ccg ctg cgt ttg ttt aac age gac ccc 
Ser Leu Ala He Thr Pro Pro Leu Arg Leu Phe Asn Ser Asp Pro 
2020 2025 2030 



27239 



gtt ctt ggt ttg ggc ttc act ttt ccc eta get gtc aca aac aac 
Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala Val Thr Asn Asn 
2035 2040 2045 



27284 



etc etc tec tta aac atg gga gac gga gtt aaa ctt acc tat aat 
Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu Thr Tyr Asn 
2050 2055 2060 



27329 
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aaa eta aca gec aat ttg ggt agg gat tta caa ttt gaa aac ggt 
Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gin Phe Glu Asn Gly 
2065 2070 2075 



27374 



gcg att gec gta acg ctt * act gec gaa tta cct ttg caa tac act 
Ala lie Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gin Tyr Thr 
2080 2085 2090 



27419 



aac aaa ctt caa ctg aat att gga get ggc ctt cgt tac aat gga 
Asn Lys Leu Gin Leu Asn lie Gly Ala Gly Leu Arg Tyr Asn Gly 
2095 2100 2105 



27464 



gee age aga aaa eta gat gta aac att aac caa aat aaa ggc tta 
Ala Ser Arg Lys Leu Asp Val Asn lie Asn Gin Asn Lys Gly Leu 
2110 2115 2120 



27509 



act tgg gac aac gat gca gtt att ccc aaa eta 
Thr Trp Asp Asn Asp Ala Val lie Pro Lys Leu 
2125 2130 2135 



gga teg ggc tta 
Gly Ser Gly Leu 



27554 



caa ttt gac cct aat ggc aac ate get gtt ate 
Gin Phe Asp Pro Asn Gly Asn lie Ala Val He 
2140 2145 2150 



cct gaa acc gtg 
Pro Glu Thr Val 



27599 



aag ccg caa acg tta tgg acg act gca gat ccc teg cct aac tgc 
Lys Pro Gin Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys 
2155 2160 2165 



27644 



tea gtg tac cag gac ttg 
Ser Val Tyr Gin Asp Leu 
2170 2175 



gat gee agg ctg tgg etc get ctt gtt 
Asp Ala Arg Leu Trp Leu Ala Leu Val 
2180 



27689 



aaa agt ggc gac atg gtg cat gga age att gec eta aaa gec eta 27734 

Lys Ser Gly Asp Met Val His Gly Ser lie Ala Leu Lys Ala Leu 

2185 2190 2195 

aaa ggg acg ttg eta aat cct aca gee age tac att tec att gtg 27779 

Lys Gly Thr Leu Leu Asn Pro Thr Ala Ser Tyr He Ser He Val 

2200 2205 2210 

ata tat ttt tac age aac gga gtc agg cgt acc aac tat cca acg 27824 

He Tyr Phe Tyr Ser Asn Gly Val Arg Arg Thr Asn Tyr Pro Thr 

2215 2220 2225 

ttt gac aac gaa ggc acc tta get aac age gee act tgg gga tac 27869 

Phe Asp Asn Glu Gly Thr Leu Ala Asn Ser Ala Thr Trp Gly Tyr 

2230 2235 2240 

cga cag ggg caa tct get aac act aat gtg acc aat gee act gaa 27914 

Arg Gin Gly Gin Ser Ala Asn Thr Asn Val Thr Asn Ala Thr Glu 

2245 2250 2255 

ttt atg ccc age tea age .agg tac ccc gtg aat aaa gga gac aac 27959 

Phe Met Pro Ser Ser Ser Arg Tyr Pro Val Asn Lys Gly Asp Asn 

2260 2265 2270 
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28094 



28136 



att caa aat caa tct ttt tea tac acc tgt att aaa gga gat ttt 28004 
lie Gin Asn Gin Ser Phe Ser Tyr Thr Cys He Lys Gly Asp Phe 
2275 2280 2285 

get atg cct gtc ccg ttc cgt gta aca tat aat cac gcc ctg gaa 28049 
Ala Met Pro Val Pro Phe Arg Val Thr Tyr Asn His Ala Leu Glu 
2290 2295 2300 

ggg tat tec ctt aag ttc acc tgg cgc gtt gta gcc aat cag gcc 
Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Val Ala Asn Gin Ala 
2305 2310 2315 

ttt gat att cct tgc tgt tea ttt tea tac ate aca gaa taa 
Phe Asp He Pro Cys Cys Ser Phe Ser Tyr He Thr Glu 
2320 2325 2330 

aaaaccactt tttcatttta atttcttttt attttacacg aacagtgaga cttcctccac 28196 
ccttccattt gacagcatac accagcctct cccccttcat agcagtaaac tgttgtgaat 28256 
cagtceggta tttgggagtt aaaatccaaa cagtctcttt ggtgatgaaa cgtcgatcag 28316 
taatggacac aaatccctgg gacaggtttt ccaacgtttc ggtgaaaaac tgcacaccgc 28376 
cctacaaaac aaacaggttc aggctctcca egggttatet ccccgatcaa actcagacag 28436 
ggtaaaggtg cggtggtgtt ccactaaacc acgcaggtgg cgctgtctga acctctcggt 28496 
gcgactcctg tgaggctggt aagaagttag attgtccagt agcctcacag catgtatcat 28556 
cagtctacga gtgcgtctgg cgcagcagcg catctgaatc tcactgagat teeggcaaga 28616 
atcgcacacc atcacaatca ggttgttcat gatcecatag ctgaacacgc tccagccaaa 28676 
gctcattcgc tccaacagcg ccaccgcgtg tccgtccaac cttactttaa cataaatcag 28736 
gtgtctgccg cgtacaaaca tgctacccac atacagaact tcccggggca ggcccctgtt 28796 
caccacctgt ctgtaccagg gaaacctcac atttatcagg gagecataga tggccatttt 28856 
aaaccaatta gctaataccg ccccaccagc tctacactga agagaacegg gagagttaca 28916 
atgacagtga ataatccatc tctcataacc cctgatggtc tgatgaaaat ctagatctaa 28976 
cgtggcacaa caaatacaca ctttcatata cattttcata acatgttttt cccaggccgt 29036 
taaaatacaa tcccaataca cgggccactc ctgeagtaca ataaagctaa tacaagatgg 29096 
tatactcctc acctcactga cactgtgcat gttcatattt tcacattcta agtaccgaga 29156 
gttctcctct acagcagcac tgctgcggtc ctcacaaggt ggtagctggt gatgattgta 29216 
gggggccagt ctgeagegat accgtctgtc gcgttgcatc gtagaccagg aaccgacgca 29276 
cctcctcgta cttgtggtag cagaaccacg tccgctgcca gcacgtctcc acgtaacgcc 29336 
ggtccctgcg tcgctcacgc tccctcctca atgcaaagtg caaccactct tgtaatccac 29396 
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acagatccct ctcggcctcc ggggtgatgc acacctcaaa cctacagatg tctcggtaca 29456 

gttccaaaca cgtagtgagg gcgagttcca accaagacag acagcctgat ctatcccgac 29516 

acactggagg tggaggaaga cacggaagag gcatgttatt ccaagcgatt caccaacggg 29576 

tcgaaatgaa gatcccgaag atgacaacgg tcgcctccgg agccctgatg gaatttaaca 29636 

gccagatcaa acgttatgcg attctccaag ctatcgatcg ccgcttccaa aagagcctgg 29696 

acccgcactt ccacaaacac cagcaaagca aaagcactat tatcaaactc ttcaatcatc 29756 

aagctgcagg actgtacaat gcctaagtaa ttttcgtttc tccactcgcg aatgatgtcg 29816 

cggcagatag tctgaaggtt catcccgtgc agggtaaaaa gctccgaaag ggcgccctct 29876 

acagccatgc gtagacacac catcatgact gcaagatatc gggctcctga gacacctgca 29936 

gcagatttaa cagatcaagg tcaggttgct ctccgcgatc acgaatctcc atccgcaagg 29996 

tcatttgcaa aaaattaaat aaatctatgc cgactagatc tgtcaactcc gcattaggaa 30056 

ccaaatcagg tgtggctacg cagcacaaaa gttccaggga tggtgccaaa ctcactagaa 30116 

ccgctcccga gtaacaaaac tgatgaatgg gagtaacaca gtgtaaaatg tgcaaccaaa 30176 

aatcactaag gtgctccttt aaaaagtcca gtacttctat attcagtccg tgcaagtact 30236 

gaagcaactg tgcgggaata tgcacaacaa aaaaaatagg gcggctcaga tacatgttga 30296 

cctaaaataa aaagaatcat taaactaaag aagcttggcg aacggtggga taaatgacac 30356 

gctccagcag cagacaggca accggctgtc cccgggaacc gcggtaaaat tcatccgaat 30416 

gattaaaaag aacaacagaa acttcccacc atgtactcgg ttggatctcc tgagcacaca 30476 

gcaatacccc cctcacattc atgtccgcca cagaaaaaaa acgtcccaga tacccagcgg 30536 

ggatatccaa cgacagctgc aaagacagca aaacaatccc tctgggagcg atcacaaaat 30596 

cctccggtga aaaaagcaca tacatattag aataaccctg ttgctggggc aaaaaggccc 30656 

ggcgtcccag caaatgcaca taa^tatgtt catcagccat tgccccgtct taccgcgtaa 30716 

tcagccacga aaaaatcgag ctaaaattca cccaacagcc tatagctata tatacactcc 30776 

gcccaatgac gctaataccg caccacccac gaccaaagtt cacccacacc cacaaaaccc 30836 

gcgaaaatcc agcgccgtca gcacttccgc aatttcagtc tcacaacgtc acttccgcgc 30896 

gccttttcac attcccacac acacccgcgc ccttcgcccc gccctcgcgc caccccgcgt 30956 

caccgcacgt caccccggcc ccgcctcgct cctccccgct cattatcata ttggcacgtt 31016 

tccagaataa ggtatattat tgatgatg 31044 
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<210> 30 

<211> 505 

<212> PRT 

<213> simian adenovirus SV-25 

<400> 30 

Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro 
! 5 * 10 15 

Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu 
20 25 30 

Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg 
35 * 40 45 

Asn Ser He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys 
50 55 60 

Val Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser Leu Asn Tyr 
65 70 75 80 

Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp 
85 90 95 

Phe Thr Pro Thr Glu Ala Gly Thr Gin Thr He Asn Phe Asp Glu Arg 
100 105 HO 

Ser Arg Trp Gly Gly Gin Leu Lys Thr He Leu His Thr Asn Met Pro 
115 120 125 

Asn He Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met 
130 135 140 

Val Glu Lys Ser Asn Ala Glu Thr Arg Gin Pro Arg Tyr Glu Trp Phe 
145 150 155 160 

Glu Phe Thr He Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr He Asp 
165 170 175 

Leu Met Asn Asn Ala He Val Asp Asn Tyr Leu Gin Val Gly Arg Gin 
180 185 190 

Asn Gly Val Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn 
195 200 205 

Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val 
210 " 215 220 

Tyr Thr Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys 
225 230 235 240 

Gly Val Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly He Arg 
245 250 255 
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Lys Arg Arg Pro Phe Gin Glu Gly Phe Gin He Met Tyr Glu Asp Leu 
260 265 270 

Glu Gly Gly Asn He Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala 
275 280 285 

Ser He Gin Arg Ala Lys Ala Glu Gly Arg Glu He Arg Gly Asp Thr 
290 295 300 

Phe Ala Val Ala Pro Gin Asp Leu Glu He Val Pro Leu Thr Lys Asp 
305 310 315 320 

Ser Lys Asp Arg Ser Tyr Asn He He Asn Asn Thr Thr Asp Thr Leu 
325 330 335 

Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly 
340 345 350 

Val Arg Ser Trp Thr He Leu Thr Thr Thr Asp Val Thr Cys Gly Ser 
355 360 365 

Gin Gin Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr 
370 375 380 

Phe Arg Pro Ser Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu 
385 390 395 400 

Leu Leu Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr 
405 410 415 

Ser Gin Leu He Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg 
420 425 430 

Phe Pro Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr 
435 440 445 

Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro 
450 455 460 

Leu Arg Ser Ser He Ser Gly Val Gin Arg Val Thr He Thr Asp Ala 
465 " 470 475 480 

Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala 
485 490 495 

Pro Lys Val Leu Ser Ser Arg Thr Phe 
500 505 

<210> 31 
<211> 921 
<212> PRT 

<213> simian adenovirus SV-25 
<400> 31 

Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His He Ala 
1 5 10 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 HO 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Pro Ser Glu Trp Thr Asp Thr Ser Asp Asn Lys Leu Lys 
130 135 140 

Ala Tyr Ala Gin Ala Pro Tyr Gin Ser Gin Gly Leu Thr Lys Asp Gly 
145 150 155 160 

He Gin Val Gly Leu Val Val Thr Glu Ser Gly Gin Thr Pro Gin Tyr 
165 170 175 

Ala Asn Lys Val Tyr Gin Pro Glu Pro Gin He Gly Glu Asn Gin Trp 
180 185 190 

Asn Leu Glu Gin Glu Asp Lys Ala Ala Gly Arg Val Leu Lys Lys Asp 
195 200 205 

Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu 
210 215 220 

Gin Gly Gly Gin Ala Lys Asn Gin Glu Val Asp Leu Gin Phe Phe Ala 
225 230 235 240 

Thr Pro Gly Asp Thr Gin Asn Thr Ala Lys Val Val Leu Tyr Ala Glu 
245 250 255 

Asn Val Asn Leu Glu Thr Pro Asp Thr His Leu Val Phe Lys Pro Asp 
260 265 270 

Asp Asp Ser Thr Ser Ser Lys Leu Leu Leu Gly Gin Gin Ala Ala Pro 
275 280 285 

Asn Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met 
290 295 300 
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Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser 
305 310 315 320 

Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser 
325 330 335 

Tyr Gin Leu Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser 
340 345 350 

Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He 
355 360 365 

Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu 
370 * 375 380 

Gly Gly Met Val Val Thr Asp Asn Tyr Asn Ser Val Thr Pro Gin Asn 
385 390 395 400 

Gly Gly Ser Gly Asn Thr Trp Gin Ala Asp Asn Thr Thr Phe Ser Gin 
405 410 415 

Arg Gly Ala Gin He Gly Ser Gly Asn Met Phe Ala Leu Glu He Asn 
420 425 430 

Leu Gin Ala Asn Leu Trp Arg Gly Phe Leu Tyr Ser Asn He Gly Leu 
435 440 445 

Tyr Leu Pro Asp Ser Leu Lys He Thr Pro Asp Asn He Thr Leu Pro 
450 455 460 

Glu Asn Lys Asn Thr Tyr Gin Tyr Met Asn Gly Arg Val Thr Pro Pro 
465 470 475 480 

Gly Leu He Asp Thr Tyr V^l Asn Val Gly Ala Arg Trp Ser Pro Asp 
485 490 495 

Val Met Asp Ser He Asn Pro Phe Asn His His Arg Aen Ala Gly Leu 
500 505 510 

Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His 
515 520 525 

He Gin Val Pro Gin Lys Phe Phe Ma He Lys Asn Leu Leu Leu Leu 
530 535 540 

Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 
545 550 555 560 

He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser 
565 570 575 

He Arg Phe Asp Ser He Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala 
580 585 590 

His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn 
595 600 605 



179 



WO 03/0461 24 PCTYUS02/33645 



Asp Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu. Tyr Pro 
610 615 620 

lie Pro Ala Asn Ala Thr Ser Val Pro He Ser He Pro Ser Arg Asn 
625 630 635 640 

Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu 
645 650 655 

Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly 
660 665 670 

Ser He Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys 
675 680 685 

Lys Val Ser He Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp 
690 695 700 

Arg Leu Leu Thr Pro Asn Glu Phe Glu He Lys Arg Ser Val Asp Gly 
705 710 715 720 

Glu Gly Tyr Asn Val Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu 
725 730 735 

He Gin Met Leu Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val 
740 745 750 

Pro Glu Asn Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin 
755 * 760 765 

Pro Met Ser Arg Gin Val Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys 
770 ~ 775 780 

Asp Val Lys Leu Pro Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr 
785 J 790 795 800 

Met Gly Pro Thr Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro 
805 810 815 

Tyr Pro Leu He Gly Glu Thr Ala Val Pro Ser Leu Thr Gin Lys Lys 
820 825 830 

Phe Leu Cys Asp Arg Val Met Trp Arg He Pro Phe Ser Ser Asn Phe 
835 " 840 845 

Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 
850 " 855 860 

Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asp 
865 870 875 880 

Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val Val Arg 
885 890 895 

He His Gin Pro His Arg Gly Val He Glu Ala Val Tyr Leu Arg Thr 
900 905 910 



180 



WO 03/046124 



PCT/US02/33645 



Pro Phe Ser Ala Gly Asn Ala Thr Thr 
915 920 



<210> 32 

<211> 347 

<212> PRT 

<213> simian adenovirus SV-25 

<400> 32 

Met Lys He Cys Val Val He Phe Ala Leu Ser Leu He Lys Thr Glu 
1 5 10 15 

Leu Phe Ala Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys 
20 25 30 

Thr Pro Gly Val Leu Asn Leu His He Ala Lys Pro Leu Tyr Phe Thr 
35 40 45 

He Glu Lys Gin Leu Ala Leu Ser He Gly Lys Gly Leu Thr He Ser 
50 ' 55 60 

Ala Thr Gly Gin Leu Glu Ser Thr Ala Ser Val Gin Asp Ser Ala Thr 
€5 70 75 80 

Pro Pro Leu Arg Gly He Ser Pro Leu Lys Leu Thr Asp Asn Gly Leu 
85 90 95 

Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp Gin Leu Thr 
100 105 HO 

Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Gly Ser Leu Thr Phe 
115 120 125 

Asn Tyr Thr Ser Pro Met Thr Leu He Asn Asn Ser Leu Ala He Asn 
130 135 140 

Val Asn Thr Ser Lys Gly Leu Ser Ser Asp Asn Gly Thr Leu Ala Val 
145 150 ' 155 160 

Asn Val Thr Pro Asp Phe Arg Phe Asn Ser Ser Gly Ala Leu Thr Phe 
165 170 175 

Gly He Gin Ser Leu Trp Thr Phe Pro Thr Lys Thr Pro Asn Cys Thr 
180 185 190 

Val Phe Thr Glu Ser Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys 
195 200 205 

Gly Ala His Val Leu Gly Ser Val Ser Leu Ser Gly Val Ala Gly Thr 
210 215 220 

Met Leu Lys Met Thr His Thr Ser Val Thr Val Gin Phe Ser Phe Asp 
225 * 230 235 240 
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Asp Ser Gly Lys Leu He Phe Ser Pro Leu Ala Asn Asn Thr Trp Gly 
245 250 255 

Val Arg Gin Ser Glu Ser Pro Leu Pro Asn Pro Ser Phe Asn Ala Leu 
260 . 265 270 

Thr Phe Met Pro Asn Ser Thr He Tyr Ser Arg Gly Ala Ser Asn Glu 
275 280 285 

Pro Gin Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Arg 
290 " 295 300 

Lys Pro He Leu Leu Thr Val Thr Tyr Asn Ser Val Asn Ser Gly Tyr 
305 310 315 320 

Ser Leu Thr Phe Lys Trp Asp Ala Val Ala Asn Glu Lys Phe Ala Thr 
325 330 335 

Pro Thr Ser Ser Phe Cys Tyr Val Ala Glu Gin 
340 345 



<210> 33 
<211> 559 
<212> PRT 

<213> simian adenovirus SV-25 
<400> 33 

Met Lys Arg Ala Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

Asp Pro Pro Tyr Ala Pro Val Met Pro Phe He Thr Pro Pro Phe Thr 
20 25 30 

Ser Ser Asp Gly Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn 
35 40 45 

Tyr Arg Asp Pro He Thr Thr Gin Asn Gly Ser Leu Thr Leu Lys Leu 
50 55 60 

Glv Asn Gly Leu Thr Leu Asn Asn Gin Gly Gin Leu Thr Ser Thr Ala 
65 70 75 80 

Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 
85 90 95 

Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu Ser 
100 105 HO 

His Thr Ala Pro Leu Val He Ala Asn Asn Ser Leu Ala Leu Gin Val 
115 120 125 

Ser Glu Pro He Phe Val Asn Asp Asp Asp Lys Leu Ala Leu Gin Thr 
130 135 140 
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Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser Ala 
145 150 155 160 

Ala Pro Leu Gly Leu Val Glu Asn Thr Leu Lys Leu Leu Phe Ser Lys 
165 170 175 

Pro Leu Tyr Leu Gin Asn Asp Phe Leu Ala Leu Ala lie Glu Arg Pro 
180 185 190 

Leu Ala Val Ala Ala Ala Gly Thr Leu Thr Leu Gin Leu Thr Pro Pro 
195 200 205 

Leu Lys Thr Asn Asp Asp Gly Leu Thr Leu Ser Thr Val Glu Pro Leu 
210 215 220 

Thr Val Lys Asn Gly Asn Leu Gly Leu Gin He Ser Arg Pro Leu Val 
225 * 230 235 240 

Val Gin Asn Asn Gly Leu Ser Leu Ala He Thr Pro Pro Leu Arg Leu 
245 250 255 

Phe Asn Ser Asp Pro Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala 
'260 265 270 

Val Thr Asn Asn Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu 
275 280 285 

Thr Tyr Asn Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gin Phe Glu 
290 295 300 

Asn Gly Ala He Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gin Tyr 
305 ~ 310 315 320 

Thr Asn Lys Leu Gin Leu Asn He Gly Ala Gly Leu Arg Tyr Asn Gly 
325 330 335 

Ala Ser Arg Lys Leu Asp Val Asn He Asn Gin Asn Lys Gly Leu Thr 
340 345 350 

Trp Asp Asn Asp Ala Val He Pro Lys Leu Gly Ser Gly Leu Gin Phe 
355 ~ 360 365 

Asp Pro Asn Gly Asn He Ala Val He Pro Glu Thr Val Lys Pro Gin 
370 * 375 380 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Val Tyr Gin 
385 390 • 395 400 

Asp Leu Asp Ala Arg Leu Trp Leu Ala Leu Val Lys Ser Gly Asp Met 
405 410 415 

Val His Gly Ser He Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Asn 
420 425 430 

Pro Thr Ala Ser Tyr He Ser He Val He Tyr Phe Tyr Ser Asn Gly 
435 440 445 
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Val Arg Arg Thr Asn Tyr Pro Thr Phe Asp Asn Glu Gly Thr Leu Ala 
450 455 460 

Asn Ser Ala Thr Trp Gly Tyr Arg Gin Gly Gin Ser ALa Asn Thr Asn 
465 470 475 480 

Val Thr Asn Ala Thr Glu Phe Met Pro Ser Ser Ser Arg Tyr Pro Val 
485 490 495 

Asn Lys Gly Asp Asn lie Gin Asn Gin Ser Phe Ser Tyr Thr Cys lie 
500 505 510 

Lys Gly Asp Phe Ala Met Pro Val Pro Phe Arg Val Thr Tyr Asn His 
515 520 525 

Ala Leu Glu Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Val Ala Asn 
530 " 535 540 

Gin Ala Phe Asp lie Pro Cys Cys Ser Phe Ser Tyr He Thr Glu 
545 550 555 



<210> 34 

<211> 34115 

<212> DNA 

<213> simian adenovirus SV-39 
<220> 

<221> CDS 

<222> (13448) (14959) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (17785) .. (20538) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (29515) (31116) 

<223> L5 Fiber #1 



<400> 34 

catcatcaat ataacaccgc aagatggcga ccgagttaac atgcaaatga ggtgggcgga 
gttacgcgac ctttgtcttg ggaacgcgga agtgggcgcg gcgggtttcg gggaggagcg 
cggggcgggg cgggcgtgtc gcgcggcggt gacgcgccgg ggacccggaa attgagtagt 180 
ttttattcat tttgcaagtt tttctgtaca ttttggcgcg aaaactgaaa cgaggaagtg 240 
aaaagtgaaa aatgccgagg tagtcaccgg gtggagatct gacctttgcc gtgtggagtt 300 
tacccgctga cgtgtgggtt tcggtctcta ttttttcact gtggttttcc gggtacggtc 360 
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aaaggtcccc 
accgtcaaga 
actgtgaaaa 
agctgttgga 
tttgtgaaga 
aggatgaatg 
atttgccacc 
catcaggtga 
acagcgggat 
aagatgatga 
gtaagtcctg 
gttatctccg 
tggctttgtt 
tttcagatgt 
cgaaactaac 
cccggaaacg 
tggacttgtc 
agcttccggg 
tacgggcggg 
atcgagtgag 
gtttgctgga 
ctctgagtcg 
tagagcagct 
agaggctttt 
ttgctgtaca 
tggacttcct 
ggcgcttggt 
tgaggcggga 
cgagagcggg 



attttatgac 
ggccactctt 
aatgaggaac 
ggacattgtt 
ttttagtctt 
gctggagtct 
acctcacaac 
acttgacttg 
cagcgatccc 
tggcgatgat 
tgaataccac 
tcttaacgct 
gagtttattt 
ggaggaacct 
tcagcgctgc 
tgctgctgtt 
cttaaagcga 
tgactcagtg 
aagggattag 
aagttccatg 
ggaggcctcc 
ctttttgcac 
gcctggactg 
tccacaattg 
tctgttggac 
gacgctatgc 
ggagaggcat 
ggatctggaa 
gctggaccct 



tccacgtcag ctgatcgcta 
gagtgccggc gagaagagtt 
ttcttgctat ctccggggct 
accggagctc tgggagacga 
catgatctct atgatattga 
gtggatgggt tttttccgga 
tctcacactg agcccgagtc 
gcttgttacg agactatgcc 
acggctttta tggtctctaa 
ggatttcgac tggacgctcc 
cgggatcgta ccggagaccc 
gcttttgtct acagtaagtg 
ttttctgtgt ctcatagggt 
gatagtacta ctggaaatga 
agacctaata ttttgagacc 
aattgcatag aagatttatt 
ccccgcccgc agtagggcgc 
caataaaaat tttcttggca 
gggagtataa agctggaggg 
gacttgtacg agagcctaga 
gacagaacct cttacatttg 
cgggtgaagc gagagcacct 
tttgattctt tgaatctcgg 
gacttttcct ctccaggccg 
agatggaacg agcagacgca 
ctatggaagt tcggaatcag 
ccgtctctgc gccagcagcg 
gccatttcgg aggaggagag 
ccg.gcggagg agtagggggg 



gggtatttaa 
ttctcctccg 
gccagcgacc 
tcctcaggtg 
tccgggtgtt 
cgctatgctg 
agctgctatt 
tccggagtcg 
ggcgattgct 
ggcggtgccg 
gtctatgttg 
ttttgtgctt 
gttgtttatt 
ggaggaaaag 
ctcggcccag 
ggaagagccc 
ggtgccagtt 
acaggtgtat 
gaaaaatctg 
gaatctaagt 
gaggtttctg 
gacggaattt 
ccaccggacg 
tctgtgttca 
gctcagcccg 
gagggggagg 
tctgcaagct 
cggcatggaa 
ataccggacc 



tgcgcctcag 
cgttccgcca 
gtagccgccg 
atttctcact 
gaggggcaag 
ctagaggctg 
cctgaattgt 
gatgaggagg 
atactaaaag 
gggagagact 
tgttctctgt 
ttttaccctg 
ataggtcctg 
ccctccccgc 
cgtgtgtcat 
actgaacctt 
ttttctctct 
gtgtttactt 
aggctgtcag 
tctttgcgac 
ttcggttccc 
gatgggcttt 
ctgctagagg 
gcgcttgctt 
ggttacactc 
aagctgtacg 
caagtgctgc 
gagaagaatc 
cttttcctga 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



185 



WO 03/046124 PCT/US02/33645 

gttggctttg ggggcggtgg ggggcgcttc tgtggtacgt gaggatgaag aggggcgcca 2160 

acgcggtcag aagagggagc attttgagtc ctcgactttc ttggctgatg taaccgtggc 2220 

cctgatggcg aaaaacaggc tggaggtggt gtggtacccg gaagtatggg aggactttga 2280 

gaagggggac ttgcacctgc tggaaaaata taactttgag caggtgaaaa catactggat 2340 

gaacccggat gaggactggg aggtggtttt gaaccgatac ggcaaggtag ctctgcgtcc 2400 

cgactgtcgc taccaggttc gcgacaaggt ggtcctgcga cgcaacgtgt acctgttggg 2460 

caacggcgcc accgtggaga tggtggaccc cagaaggggt ggttttgtgg ccaatatgca 2520 

agaaatgtgc cctggggtgg tgggcttgtc tggggtgact tttcatagtg tgaggtttag 2580 

cggtagcaat tttgggggtg tggttattac cgcgaacact cctgtggtcc tgcataattg 2640 

ctactttttt ggcttcagca acacctgtgt ggaaatgagg gtgggaggca aagtgcgcgg 2700 

gtgttccttt tacgcttgct ggaagggggt ggtgagccag ggtaaggcta aagtgtctgt 2760 

tcacaagtgt atgttggaga gatgcacctt gggcatttcc agtgagggct tcctccacgc 2820 

cagcgacaac gtggcttctg acaacggctg cgcctttctt atcaagggag ggggtcgcat 2880 

ctgtcacaac atgatatgcg gccctgggga tgtcccccca aagccttacc agatggttac 2940 

ctgcacagat ggcaaggtgc gcatgctcaa gcctgtgcac attgtgggcc accggcgcca 3000 

ccgctggcca gagtttgaac acaatgtgat gacccgctgt agcttgtacc tgggaggcag 3060 

gcgaggagtt ttcttgccca gacagtgtaa cctggcccac tgcaacgtga tcatggaaca 3120 

atccgccgct acccaggttt gctttggagg aatatttgat ataagcatgg tggtgtataa 3180 

gatcctgcgc tacgacgact gtcgggctcg tactcgaacc tgcgactgcg gagcctctca 3240 

cctgtgtaac ctgactgtga tggggatggt gactgaggag gtgcgactgg accactgtca 3300 

gcactcttgc ctgcgggagg agttttcttc ctcggacgag gaggactagg taggtggttg 3360 

gggogtggcc agcgagaggg tgggctataa aggggaggtg tcggctgacg ctgtcttctg 3420 

tttttcaggt accatgagcg gatcaagcag ccagaccgcg ctgagcttcg acggggccgt 3480 

gtacagcccc tttctgacgg ggcgcttgcc tgcctgggcc ggagtgcgtc agaatgttac 3540 

cggttcgacc gtggacggac gtcccgtgga tccatctaac gctgcttcta tgcgctacgc 3600 

tactatcagc acatctactc tggacagcgc cgctgccgcc gcagccgcca cctcagccgc 3660 

tctctccgcc gccaagatca tggctattaa cccaagcctt tacagccctg tatccgtgga 3720 

cacctcagcc ctggagcttt accggcgaga tctagctcaa gtggtggacc aactcgcagc 3780 
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cgtgagccaa 
gtaaccgcaa 
tgtggttgac 
gtccatcgtt 
gtctgaacgt 
agggcctcgt 
tggtgctgga 
gtgttgacaa 
gcctggatct 
aagaccacca 
aaagcgtgga 
atgatgatgg 
acatcgtagt 
agggtcccag 
atttgcattt 
aaaaaaatgg 
tgtgacttgc 
tttagagagc 
ctaacatgta 
agttcaggaa 
ctgagagttt 
tctcgatcca 
ggcggtgggc 
gggtggtctc 
tgaggctgag 
agcaacggac 
tgcctttgga 
gcgccaagaa 
cgcattccac 



cagttgcagc 
aaattcaata 
tttattcctg 
cccggtcgtt 
tgagatacat 
tttcaggggt 
agatgtcctt 
agcggttgag 
tgaggttggc 
gcaccgagta 
agaatttgga 
cgatgggccc 
tgtggtcctg 
actggggaat 
cccaggcttt 
tctctggggc 
cgcagccggt 
tacaactgcc 
agttttcccg 
gcgaagcaaa 
gactgagcag 
gcagacctcc 
gtccagctgg 
ggtcacggtg 
gcggctggtg 
catgagctca 
aacgtgcccg 
cacggattcc 
cagccaggtc 



tggtgtcgac 
aacagaattt 
gatagctggg 
gagaacacgg 
gggcatgagc 
ggtgttgtaa 
cagcagcaag 
ttgggagggg 
tatgttgccg 
accggtgcag 
gacccctcgg 
ccgggaggca 
ggtgagttca 
gatggttcca 
gatttcagag 

gggggtgatg 

gggcccgtag 
atcatccttc 
cactaagtcc 
gtttttaagt 
ttccagccgg 
tcgtttcggg 
gccatggtgc 
aatgggtggg 
gcgaaccgtt 
tagtccaggc 
cagtttgagc 
ggggaataag 
agctgaggat 



ccgagtggag 
aataaacagc 
gggagggaac 
tggatttttt 
ccgtctcggg 
atgatccagt 
ctgatggcaa 
tgcatgcggg 
cccagatcgc 
cgggggaatt 
tgcccgccta 
gcctgggcaa 
tcataggaca 
tccggtccgg 
ggagggatca 
agctgggtgg 
atgacagcta 
aaaagcgggg 
tgcaggagac 
ggcttgaggc 
tcccagagct 
ggttggggcg 
ggtccctcca 
ccccgggctg 
gcttttcgtc 
tctctgcggc 
agagcaagca 
catccccacc 
cttttgggtc 



caactttccc 
acttgagaaa 
ggcgggaacg 
ccaagacccg 
ggtggaggta 
cgtaggcccc 
cgggaagacc 
gactgatgag 
gcctgggatt 
tgtcgtgcag 
ggttttccat 
aaacgttgcg 
ttttgacaaa 
gggcgtagtt 
tgtcaacctg 
aaagcaggtt 
tgacgggttg 
ccacactgtt 
gtgaccctcc 
catcggccaa 
cggttacgtg 
gctctggctg 
tgggcgcagg 
ggcgctggcc 
tccctgcaag 
atgtcctttg 
ttttagcgcg 
gcagttggag 
aaaaaccaag 



gccctcccca 
agtttaaact 
gtaagacctg 
atagaggtgg 
ggcccactgc 
ccgctgggcg 
cttggtgtag 
gtgcattttg 
catgttatgc 
cttggaaggg 
gcactcatcc 
ggggtccgtg 
gcgcgggcag 
gccctcgcag 
gggggcgatg 
gcgcaagagc 
cagggtgtag 
taaaagttct 
tagggagaga 
gggcaagttc 
ctctacggca 
tagggaatga 
gttctcttca 
agggtgcgct 
tcagccaaat 
gcgcgaagct 
tagagttttg 
caaacggttt 
cgcccgccgt 
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tttttttgat 
agaggctgtc 
ggtcctcggc 
cgaatgaggc 
aggtgtgcag 
aagtcacgtg 
cactctcttc 
aggcgggcat 
cgagcgcccc 
tctgtcggtt 
cgatggagcg 
gctggacgta 
gcaccagcct 
cctctccgcg 
ggggcagggg 
gcagcaggtg 
agtcccgcac 
gcgtcagggc 
cgccgaggta 
gctcgtgcga 
ggaagaggat 
tgttgaagcg 
gcagcttttc 
cccgcacgat 
actcttcgcg 
agcccaacat 
gcgagtaggc 
ccataacttt 



gcgcttccta 
ggtgtctccg 
gtagagaaac 
gatgtgggaa 
gcacatgtcc 
atcctgtcct 
cggttcgctg 
gacctgtgcg 
cgtggagatg 
atcaagcttg 
ctgggtttgg 
ctccctggcc 
cacgctccat 
caggggctcg 
gtcaagcagg 
tttgtcaaag 
cgccaaggcg 
cgaggcgtac 
ggtggggtag 
gggggccaga 
ctgcctgaag 
ggcgtcgggc 
caccagggag 
gtcataatgc 
gtctttccag 
gtaaaactgg 
cagggcggcc 
tacaaactgg 



cctcgggtct 
tagacggagg 
tcgcaccact 
gggtaccggt 
ccctcgtccg 
tccgcggggg 
tcgccaaagg 
ctgaggttgt 
cccttgaggt 
gtggcaaaag 
tttttttccc 
acgcacttcc 
ccgcggttgt 
ttggtccagc 
cgctcgtccg 
taatcgatct 
cgctcgtatg 
atgccgcaga 
cagcgccccc 
aggtggcggc 
atggcgtggg 
agacccacgg 
gcggtgacca 
tcttcctttt 
tactcttgga 
ttgacggcct 
ttgcgcaggg 
tacttaaagt 



ccatgaggcg 
tcagggcgcg 
ctgacataaa 
cgttatcgat 
cttccaaaaa 
tataaaaggg 
ccagctgttg 
cagtttctat 
gctcggggcc 
acccgtagag 
ggtcggcttt 
agccgggaaa 
gcagggtgat 
agaggcgacc 
gggggtcggc 
gatgcccggg 
gactgagggg 
tgtcatagac 
cgcggatgct 
tgaggtgagc 
agttggagga 
cctcgccgat 
gcacgtccag 
tttccttcca 
gaggaaaccc 
gatagggaca 
aggtgtgagt 
cccggtcgtc 



gtgcccgcgt 
ctcctccagg 
cgcccgggtc 
gagggggtcg 
tgtgattggc 
ggcgtttccc 
gggtacgtaa 
atacgaggaa 
catttggtca 
ggcgttggag 
ttccttggcc 
aacggccgtg 
gacgtcgatg 
gcccttgcga 
gtcgatggta 
gcaacgcagg 
ggcgccccag 
gtaaaggggc 
ggcccgtacg 
gcgctggggc 
gatggtgggc 
aaagtgggcg 
agcgcagtag 
gaggtctcgg 
gttttcgtct 
gcatcccttc 
cagggcaaag 
gcacatgcct 



tcggtgacga 

ggggtcccgc 

caggctagga 
gttttttcca 
ttgtaggtgt 
ccctcctcgt 
acgcgggtga 
gatttgatgg 
gaaaacacaa 
agcaacttgg 
gcgatgttga 
cgctcgtccg 
ctggtggcca 
gagcagaagg 
aagatggcgg 
gcggtttccc 
ggcatgggat 
tcctccagga 
tagtcgtaga 
ttttcatctc 
cgctgaaaaa 
taggactctt 
tccagggttt 
ttgaagagat 
ccacggtaag 
tccacgggca 
gtgtcgcgga 
cgctcccagt 
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7260 
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ctgagtagtc tgtgcgcttt ttgtgcttgg ggttaggcag ggagtaggtg acgtcgttaa 
agaggatttt gccacatctg ggcataaagt tgcgagagat tctgaagggg ccgggcacct 
ccgagcggtt gttgatgact tgggcagcca ggagaatttc gtcgaagccg ttgatgttgt 
gccccacgac gtagaactct atgaaacgcg gagcgccgcg cagcaggggg cacttttcaa 
gttgctggaa agtaagttcc cgcggctcga cgccgtgttc cgtgcggctc cagtcctcca 
ccgggtttcg ctccacaaaa tcctgccaga tgtggtcgac tagcaagagc tgcagtcggt 7560 
cgcgaaattc gcggaatttt ctgccgatgg cttgcttctg ggggttcaag caaaaaaagg 7620 
tgtctgcgtg gtcgcgccag gcgtcccagc cgagctcgcg agccagattc agggccagca 
gcaccagagc cggctcaccg gtgattttca tgacgaggag aaagggcacc agctgttttc 
cgaacgcgcc catccaggtg taggtctcca cgtcgtaggt gagaaacaga cgttcggtcc 
gcgggtgcga tcccaggggg aaaaacttga tgggctgcca ccattgggag ctctgggcgt 
ggatgtgatg gaagtaaaag tcccggcggc gcgtggaaca ttcgtgctgg tttttgtaaa 
agcggccgca gtggtcgcag cgcgagacgg agtgaaggct gtgaatcagg tgaatcttgc 7980 
gtcgctgagg gggccccaga gccaaaaagc ggagcgggaa cgaccgcgcg gccacttcgg 8040 
cgtccgcagg caagatggat gagggttcca ccgttccccg cccgcggacc gaccagactt 8100 
ccgccagctg cggcttcagt tcttgcacca gctctcgcag cgtttcgtcg ctgggcgaat 8160 
cgtgaatacg gaagttgtcg ggtagaggcg ggaggcggtg gacttccagg aggtgtgtga 
gggccggcag gagatgcagg tggtacttga tttcccacgg atgacggtcg cgggcgtcca 
aggcgaagag atgaccgtgg ggccgcggcg ccaccagcgt tccgcggggg gtctttatcg 
gcggcgggga cgggctcccg gcggcagcgg cggctcggga cccgcgggca agtcgggcag 
cggcacgtcg gcgtggagct cgggcagggg ctggtgctgc gcgcggagct gactggcaaa 
ggctatcacc cggcgattga cgtcctggat ccggcggcgc tgcgtgaaga ccaccggacc 
cgtggtcttg aacctgaaag agagttcgac agaatcaatc tcggcatcgt taaccgcggc 
ctggcgcagg atttcggcca cgtccccgga gttgtcttga tacgcgattt ctgccatgaa 
ctggtcgatt tcctcttcct gcaagtctcc gtgaccggcg cgttcgacgg tggccgcgag 8700 
atcgttggag atgcggccca tgagctggga aaaggcattg atgccgacct cgttccacac 8760 
tcggctgtac accacctctc cgtgaacgtc gcgggcgcgc atcaccacct gggcgagatt 8820 
gagttccacg tggcgggcga aaaccggata gtttcggagg cgctgataca gatagttgag 8880 
ggtggtggcg gcgtgctcgg ccacaaaaaa atacatgatc cagcggcgga gggtcagctc 8940 
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gttgatgtcg cccagcgcct ccaggcgttc catggcctcg taaaagtcca cggcaaagtt 9000 

gaaaaattgg ctgttcctgg ccgagaccgt gagctcttct tccaagagcc gaatgagatc 9060 

cgccacggtg gccctgactt cgcgttcgaa agccccgggt gcctcctcca cctcttcctc 9120 

ctcgacttct tcgaccgctt cgggcacctc ctcttcctcg accaccacct caggcggggc 9180 

tcggcggcgc cggcggcgga cgggcaggcg gtcgacgaaa cgctcgatca tttcccccct 9240 

ccgtcgacgc atggtctcgg tgacggcgcg accctgttcg cgaggacgca gggtgaaggc 9300 

gccgccgccg agcggaggta acagggagat cggggggcgg tcgtggggga gactgacggc 9360 

gctaactatg catctgatca atgtttgcgt agtgacctcg ggtcggagcg agctcagcgc 9420 

ttgaaaatcc acgggatcgg aaaaccgttc caggaacgcg tctag'ccaat cacagtcgca 9480 

aggtaagctg aggaccgtct cgggggcttg tctgttctgt cttcccgcgg tggtgctgct 9540 

gatgaggtag ttgaagtagg cgctcttgag gcggcggatg gtggacagga gaaccacgtc 9600 

tttgcgccca gcttgctgta tccgcaggcg gtcggccatg ccccacactt ctccttgaca 9660 

gcggcggagg tccttgtagt attcttgcat cagcctttcc acgggcacct cgtcttcttc 9720 
ttccgctcgg ccggacgaga gccgcgtcag gccgtacccg cgctgcccct gtggttggag 9780 
cagggccagg tcggccacga cgcgctcggc cagcacggcc tgctggatgc gggtgagggt 9840 
gtcctgaaag tcgtcgagat ccacaaagcg gtggtacgcg ccagtgttga tggtgtaggt 9900 
gcagttgctc atgacggacc agtttacggt ctgggtgcca tggcccacgg tttccaggta 9960 

gcggagacgc gagtaggccc gcgtctcgaa gatgtagtcg ttgcaggtcc gcagcaggta 10020 

ctggtagccc accagcagat gcggcggcgg ctggcggtag aggggccacc gctgggtggc 10080 

gggggcgttg ggggcgagat cttccaacat gaggcggtga tagccgtaga tgtagcgcga 10140 

catccaagtg atgccgctgg ccgtggtgct ggcgcgggcg tagtcgcgaa cgcggttcca 10200 

gatgtttcgc agcggctgga agtactcgat ggtggggcga ctctgccccg tgaggcgggc 10260 

gcagtcggcg atgctctacg gggaaaaaga agggccagtg aacaaccgcc ttccgtagcc 10320 

ggaggagaac gcaagggggt caaagaccac cgaggctcgg gttcgaaacc cgggtggcgg 10380 

cccgaatacg gagggcggtt ttttgctttt ttctcagatg catcccgtgc tgcggcagat 10440 

gcgtccgaac gcggggtccc agtccccggc ggtgcctgcg gccgtgacgg cggcttctac 10500 

ggccacgtcg cgctccaccc cgcctaccac ggcccaggcg gcggtggctc tgcgcggcgc 10560 

aggggaaccc gaagcagagg cggtgttgga cgtggaggag ggccaggggt tggctcggct 10620 
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gggggccctg agtcccgagc ggcacccgcg cgtggctctg aagcgcgacg cggcggaggc 10680 

gtacgtgccg cggagcaatc tgtttcgcga ccgcagcggc gaggaggccg aggagatgcg 10740 

agacttgcgt tttcgggcgg ggagggagtt gcgtcacggg ctggaccggc agagggttct 10800 

gagagaggag gactttgagg cggacgagcg cacgggggtg agtcccgcgc gggctcacgt 10860 

ggcggccgcc aacctggtga gcgcgtacga gcagacggtc aaggaggaga tgaacttcca 10920 

gaagagcttc aatcatcacg tgcgcacgct gattgcgcgc gaagaggtgg ccatcggcct 10980 

catgcatctg tgggattttg tggaggcgta cgttcagaac cccagcagca agccgctgac 11040 

ggctcagctg ttcctcatcg tgcaacatag tcgagacaac gaaacgttca gggaggccat 11100 

gctgaacatt gcagagcctg aggggcgctg gctcttggat ctcattaaca tcttgcagag 11160 

tatcgtagtg caggagcgct cgctgagcct ggccgacaag gtggctgcca tcaactacag 11220 

catgctgtcg ctgggcaaat tttacgcccg caagatctac aagtctccgt tcgtccccat 11280 

agacaaggag gtgaagatag acagctttta catgcgcatg gcgctcaagg tgctgactct 11340 

aagcgacgac ctgggggtgt accgcaacga ccgcatacac aaggcggtga gcgccagccg 11400 

ccggcgcgag ctgagcgacc gcgagctttt gcacagcctg catcgggcgt tgactggtgc 11460 

cggcagcgcc gaggcggccg agtactttga cgccggagcg gacttgcgct ggcagccatc 11520 

ccgacgcgcg ctggaggcgg ctggcgtcgg ggagtacggg gtcgaggacg acgatgaagc 11580 

ggacgacgag ttgggcattg actt'gtagcc gtttttcgtt agatatgtcg gcgaacgagc 11640 

cgtctgcggc cgccatggtg acggcggcgg gcgcgcccca ggacccggcc acgcgcgcgg 11700 

cgctgcagag tcagccttcc ggagtgacgc ccgcggacga ctggtccgag gccatgcgtc 11760 

gcatcctggc gctgacggcg cgcaaccccg aggcttttcg gcagcagccg caggcaaacc 11820 

ggtttgcggc cattttggaa gcggtggtgc cctccagacc caaccccacc cacgaaaagg 11880 

tgctggccat cgtcaacgcc ctggcggaga ccaaggccat ccgcccagac gaggccgggc 11940 

aggtttacaa cgcgctgcta gaaagggtgg gacgctacaa cagctccaac gtgcagacca 12000 

atctggaccg cttggtgacg gacgtgaagg aggccgtagc ccagcgagag cggtttttca 12060 

aggaagccaa tctgggctcg ctggtggccc tcaacgcctt cctgagcacg ctgccggcga 12120 

acgtgccccg cggtcaggag gactacgtga actttctgag cgccctccgc ctgatggtgg 12180 

ccgaggtgcc gcagagcgag gtgtaccagt ctggccccaa ctactacttc cagacctccc 12240 

ggcagggcct gcagacggta aacctgacgc aggcctttca gaacctgcag ggcctttggg 12300 

gggtgcgcgc tccgctgggc gaccgcagca cggtgtccag cctgctgacc cccaatgccc 12360 
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ggctgctctt gcttctcatt gctccgttca ccgacagcgg ttccatcagc cgcgactctt 12420 

acctgggaca cctgctcacc ctgtaccggg aggccatcgg gcaggcgcgg gtggacgagc 12480 

agacgtacca ggaaatcacc agcgtgagcc gcgcgctggg gcaggaggac acgggcagct 12540 

tggaggcgac tctgaacttc ctgctgacca accggcggca gcgcctacct ccccagtacg 12600 

cgctgaacgc ggaggaggag cgcatcctgc gtttcgtgca gcagagcacc gcgctgtact 12660 

tgatgcggga aggcgcctct cccagcgctt cgctggacat gacggcggcc aacatggagc 12720 

catcgttcta cgccgccaac cgtcccttcg tcaaccggct aatggactat ttgcatcggg 12780 

cggcggccct gaacccggaa tactttacta acgtcatcct gaacgaccgt tggctgocac 12840 

ctcccggctt ctacacgggg gagttcgacc tcccggaggc caacgacggt ttcatgtggg 12900 

acgacgtgga cagcgtgttc ctgcccggca agaaggaggc gggtgactct cagagccacc 12960 

gcgcgagcct cgcagacctg ggggcgaccg ggcccgcgtc tccgctgcct cgcctgccga 13020 

gcgccagcag cgccagcgtg gggcgggtga gccgtccgcg cctcagcggt gaggaggact 13080 

ggtggaacga tccgctgctc cgtccggccc gcaacaaaaa cttccccaac aacgggatag 13140 

aggatttggt agacaaaatg aaccgttgga agacgtatgc ccaggagcat cgggagtggc 13200 

aggcgaggca acccatgggc cctgttctgc cgccctctcg gcgcccgcgc agggacgaag 13260 

acgccgacga ttcagccgat gacagcagcg tgttggatct gggcgggagc gggaacccct 13320 

ttgcccacct gcaacctcgc ggcgtgggtc ggcggtggcg ctaggaaaaa aaattattaa 13380 

aagcacttac cagagccatg gtaagaagag caacaaaggt gtgtcctgct ttcttcccgg 13440 

tagcaaa atg cgt egg gcg gtg gca gtt ccc tec gcg gca atg gcg tta 13489 
Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu 
15 10 

ggc ccg ccc cct tct tac gaa age gtg atg gca gcg gec acc ctg caa 
Glv Pro Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gin 
15 20 . 25 30 

gcg ccg ttg gag aat cct tac gtg ccg ccg cga tac ctg gag cct acg 
Ala Pro Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr 
35 40 45 

ggc ggg aga aac age att cgt tac teg gag ctg acg ccc ctg tac gac 
Gly Gly Arg Asn Ser He Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp 
50 55 60 

acc acc cgc ctg tac ctg -gtg gac aac aag tea gca gat ate gec acc 
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Thr 
65 "70 75 



13537 
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13681 



192 



WO 03/046124 



PCT/US02/33645 



ttg aac tac cag aac gac cac age aac ttt etc acg tec gtg gtg cag 
Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gin 
80 85 90 

aac age gac tac acg ccc gee gaa gcg age acg cag acc att aac ttg 
Asn Ser Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gin Thr He Asn Leu 
95 100 105 HO 

gac gac cgc teg cgc tgg ggc ggg gac ttg aaa acc att ctg cac act 
Asp Asp Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr He Leu His Thr 
115 120 125 

aac atg ccc aac gtg aac gag ttc atg ttt acc aac teg ttc agg get 
Asn Met Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala 
130 135 140 



tgc ggg gtg gac ttt acc tac age egg tta aac aac ctg eta ggc ata 
Cys Gly Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly He 
240 245 250 

cgc aag aga atg ccc ttt cag gaa ggg ttt cag ate ctg tac gag gac 
Arg Lys Arg Met Pro Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp 
255 260 265 270 



13729 



13777 



13825 



13873 



13969 



14017 



aaa ctt atg gtg gcg cac gag gee gac aag gac ccg gtt tat gag tgg 13921 
Lys Leu Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp 
145 150 155 

gtg cag ctg acg ctg ccg gag ggg aac ttt tea gag att atg acc ata 
Val Gin Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu He Met Thr He 
160 165 170 

gac ctg atg aac aac gee att ate gac cac tac ctg gcg gta gec aga 
Asp Leu Met Asn Asn Ala He He Asp His Tyr Leu Ala Val Ala Arg 
175 180 185 190 

cag cag ggg gtg aaa gaa age gag ate ggc gtc aag ttt gac acg cgc 14065 
Gin Gin Gly Val Lys Glu Ser Glu He Gly Val Lys Phe Asp Thr Arg 
195 200 205 

aac ttt cgt ctg ggc tgg gac ccg gag acg ggg ctt gtg atg ccg ggg 14113 
Asn Phe Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly 
210 215 220 

gtg tac acg aac gaa get ttc cat ccc gac gtg gtc etc ttg ccg ggc 14161 
Val Tyr Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly 
225 230 235 



14209 



14257 



ctg gag ggc ggt aac ate ccg gee ctg ctg gac gtg ccg gcg tac gag 14305 
Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu 
275 280 285 

gag age ate gee aac gca agg gag gcg gcg ate agg ggc gat aat ttc 14353 
Glu Ser He Ala Asn Ala Arg Glu Ala Ala He Arg Gly Asp Asn Phe 
290 295 300 
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gcg gcg cag ccc cag gcg get cca acc ata aaa ccc gtt ttg gaa gac 14401 
Ala Ala Gin Pro Gin Ala Ala Pro Thr lie Lys Pro Val Leu Glu Asp 



305 



310 315 



ctt atg ccg att cac age aag age ttt tac aac gag cag gec gtc tac 
Leu Met Pro He His Ser Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr 
400 405 410 

tea cag etc att cgt cag aco acc gee eta acg cac gtt ttc aac cgc 
Ser Gin Leu He Arg Gin Thr Thr Ala Leu Thr His Val Phe Asn Arg 
415 420 425 430 

ttc ccc gag aac caa ate eta gtg cga cct cca gcg ccc acc ate acc 
Phe Pro Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr 
435 440 445 

acc gtc age gag aac gtg ccc get eta acc gat cac ggg acg ctg cct 
Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro 
450 455 460 

ttg cag aac age ate cgc gga gtt cag cga gtt acc ate acg gac gee 
Leu Gin Asn Ser He Arg Gly Val Gin Arg Val Thr He Thr Asp Ala 

m M A *7 C 



465 



470 475 



tec aaa ggg egg age tac aac gta ata gec aac acc aac aac acg get 14449 
Ser Lys Gly Arg Ser Tyr Asn Val He Ala Asn Thr Asn Asn Thr Ala 
320 325 330 

tac agg age tgg tat ctg get tat aac tac ggc gac ccg gag aag ggg 14497 
Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly 
335 " 340 345 350 

gtt agg gee tgg acc ctg etc acc act ccg gac gtg acg tgc ggt tea 14545 
Val Arg Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser 
355 360 365 

gag cag gtc tac tgg teg ctg cct gac atg tac gtg gac cct gtg acg 14593 
Glu Gin Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr 
370 375 380 

ttt cgc tec acg cag caa gtt age aac tac cca gtg gtg gga gcg gag 14641 
Phe Arg Ser Thr Gin Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu 
385 390 395 



14689 



14737 



14785 



14833 



14881 



cgt cgt egg acc tgt ccc tac gtc tac aaa gec ttg gga ate gtg gec 14929 
Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly He Val Ala 
480 485 490 

ccg cgc gtc ctg teg agt cgc act ttc tag atgtccatcc tcatctctcc 14979 
Pro Arg Val Leu Ser Ser Arg Thr Phe 
495 500 

cagcaacaat accggttggg gtctgggcgt gaccaaaatg tacggaggcg ccaaacgacg 15039 

gtccccacaa catcccgtgc gagtgegegg gcactttaga gccccatggg ggtcgcacac 15099 

gcgcgggcgc accggccgaa ccaccgtcga cgaogtgatc gatagcgtgg tggccgacgc 15159 
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ccgcaactac cagcccgctc gatccacggt ggacgaagtc atcgacggcg tggtggccga 15219 

cgccagggcc tacgcccgca gaaagtctcg tctgcgccgc cgccgttcgc taaagcgccc 15279 

cacggccgcc atgaaagccg ctcgctctct gctgcgtcgc gcacgtatcg tgggtcgccg 15339 

cgccgccaga cgcgcagccg ccaacgccgc cgccggccga gtgcgccgcc gggccgccca 15399 

gcaggccgcc gccgccatct ccagtctatc cgccccccga cgcgggaatg tgtactgggt 15459 

cagggactcg gccaccggcg tgcgagttcc cgtgagaacc cgtcctcctc gtccctgaat 15519 

aaaaagttct aagcccaatc ggtgttccgt tgtgtgttca gctcgtcatg accaaacgca 15579 

agtttaaaga ggagctgctg caacfcgctgg tccccgaaat ctatgcgccg gcgccggacg 15639 

tgaaaccgcg tcgcgtgaaa cgcgtgaaga agcaggaaaa gctagagaca aaagaggagg 15699 

cggtggcgtt gggagacggg gaggtggagt ttgtgcgctc gttcgcgccg cgtcggcgag 15759 

tgaattggaa ggggcgcaag gtgcaacggg tgctgcgtcc cggcacggtg gtgtctttca 15619 

ccccgggtga aaaatccgcc tggaagggca taaagcgcgt gtacgatgag gtgtacgggg 15879 

acgaagacat tctggagcag gcgctggata gaagcgggga gtttgcttac ggcaagaggg 15939 

cgaggacggg cgagatcgcc atcccgctgg acacttccaa ccccaccccc agtctgaaac 15999 

ccgtgacgct gcaacaggtg ttgccggtga gcgccccctc gcgacgcggc ataaaacgcg 16059 

agggcggcga gctgcagccc accatgcagc tcctggttcc caagaggcag aaactagagg 16119 

acgtactgga catgataaaa atggagcccg acgtgcagcc cgatattaaa atccgtccca 16179 

tcaaagaagt ggcgccggga atgggcgtgc agaccgtgga catccagatt cccatgacca 16239 

gcgccgcaca ggcggtagag gccatgcaga ccgacgtggg gatgatgacg gacctgcccg 16299 

cagctgctgc cgccgtggcc agcgccgcga cgcaaacgga agccggcatg cagaccgacc 16359 

cgtggacgga ggcgcccgtg cagccggcca gaagacgcgt cagacggacg tacggccccg 16419 

tttctggcat aatgccggag tacgcgctgc atccttccat catccccacc cccggctacc 16479 

gggggcgcac ctaccgtccg cgacgcagca ccactcgccg ccgtcgccgc acggcacgag 16539 

tcgccaccgc cagagtgaga cgcgtaacga cacgtcgcgg ccgccgcttg accctgcccg 16599 

tggtgcgcta ccatcccagc atfcctttaaa aaaccgctcc tacgttgcag atgggcaagc 16659 

ttacttgtcg actccgtatg gccgtgcccg gctaccgagg aagatcccgc cgacgacgga 16719 

ctttgggagg cagcggtttg cgccgccgtc gggcggttca ccggcgcctc aagggaggca 16779 

ttctgccggc cctgatcccc ataatcgccg cagccatcgg ggccattccc ggaatcgcca 16839 
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gcgtagcggt gcaggctagc cagcgccact gattttacta accctgtcgg tcgcgccgtc 16899 

tctttcggca gactcaacgc ccagcatgga agacatcaat ttctcctctc tggccccgcg 16959 

gcacggcacg cggccgtata tggggacgtg gagcgagatc ggcacgaacc agatgaacgg 17019 

gggcgctttc aattggagcg gtgtgtggag cggcttgaaa aatttcggtt ccactctgaa 17079 

aacttacggc aaccgggtgt ggaactccag cacggggcag atgctgaggg acaagctaaa 17139 

ggacacgcag tttcagcaaa aggtggtgga cggcatcgct tcgggcctca acggcgccgt 17199 

cgacctggcc aaccaggcca ttcaaaagga aattaacagc cgcctggagc cgcggccgca 17259 

ggtggaggag aacctgcccc ctctggaggc gctgcccccc aagggagaga agcgcccgcg 17319 

gcccgacatg gaggagacgc tagttactaa gagcgaggag ccgccatcat acgaggaggc 17379 

ggtgggtagc tcgcagctgc cgtccctcac gctgaagccc accacctatc ccatgaccaa 17439 

gcccatcgcc tccatggcgc gccccgtggg agtcgacccg cccatcgacg cggtggccac 17499 

tttggacctg ccgcgccccg aacccggcaa ccgcgtgcct cccgtcccca tcgctccgcc 17559 

ggtttctcgc cccgccatcc gccccgtcgc cgtggccact ccccgctatc cgagccgcaa 17619 

cgccaactgg cagaccaccc tcaacagtat tgtcggactg ggggtgaagt ctctgaagcg 17679 

ccgtcgctgt ttttaaagca caatttatta aacgagtagc cctgtcttaa tccatcgttg 17739 

tatgtgtgcc tatatcacgc gttcagagcc tgaccgtccg tcaag atg gcc act ccg 17796 
y " * Met Ala Thr Pro 

505 

teg atg atg ccg cag tgg teg tac atg cac ate gcc ggg cag gac gcc 17844 
Ser Met Met Pro Gin Trp Ser Tyr Met His He Ala Gly Gin Asp Ala 
510 515 520 

teg gag tac ctg age ccg ggt ctg gtg cag ttt gcc cgt gcg a eg gaa 17892 
Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Glu 
525 530 535 

ace tac ttc tea ctg ggc aac aag ttc agg aac ccc acc gtg gcg ccc 
Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro 
540 545 550 555 

acc cac gac gtc acc acc gat egg tec cag cga ctg aca ate cgc ttc 
Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu Thr He Arg Phe 
560 565 570 

gtc ccc gtg gac aag gaa gac acc get tac tec tac aaa acc cgc ttc 
Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr Lys Thr Arg Phe 
575 580 585 

acg ctg gcc gtg ggc gac aac egg gtg eta gac atg gcc agt acc tac 
Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr 
590 * 595 600 



17940 
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ttt gac ate cgc ggc gtg ate gac cgc gga cct age ttc aag cct tac 
Phe Asp He Arg Gly Val He Asp Arg Gly Pro Ser Phe Lys Pro Tyr 
605 ^ 610 615 

tec ggc acg get tac aac tea ctg get ccc aaa ggg gcg ccc aac aac 
Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Asn 
620 625 630 635 

age caa tgg aac gcc aca gat aac ggg aac aag cca gtg tgt ttt get 
Ser Gin Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro Val Cys Phe Ala 
640 645 650 

cag gca get ttt ata ggt caa age att aca aaa gac gga gtg caa ata 
Gin Ala Ala Phe He Gly Glji Ser He Thr Lys Asp Gly Val Gin He 
655 660 665 

cag aac tea gaa aat caa cag get get gcc gac aaa act tac caa cca 
Gin Asn Ser Glu Asn Gin Gin Ala Ala Ala Asp Lys Thr Tyr Gin Pro 
670 675 680 

gag cct caa att gga gtt tec acc tgg gat acc aac gtt ace agt aac 
Glu Pro Gin He Gly Val Ser Thr Trp Asp Thr Asn Val Thr Ser Asn 
685 690 695 

get gcc gga cga gtg tta aaa gcc acc act ccc atg ctg cca tgt tac 
Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met Leu Pro Cys Tyr 
700 705 710 715 

ggt tea tat gcc aat ccc act aat cca aac ggg ggt cag gca aaa aca 
Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly Gin Ala Lys Thr 
720 725 730 

gaa gga gac att teg eta aac ttt ttc aca aca act gcg gca gca gac 
Glu Gly Asp He Ser Leu Asn Phe Phe Thr Thr Thr Ala Ala Ala Asp 
735 740 745 

aat aat ccc aaa gtg gtt ctt tac age gaa gat gta aac ctt caa gcc 
Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asn Leu Gin Ala 
750 755 760 

ccc gat act cac tta gta tat aag cca acg gtg gga gaa aac gtt ate 
Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly Glu Asn Val He 
765 770 775 

gcc gca gaa gcc ctg eta acg cag cag gcg tgt ccc aac aga gca aac 
Ala Ala Glu Ala Leu Leu Thr Gin Gin Ala Cys Pro Asn Arg Ala Asn 
780 785 790 795 

tac ata ggt ttc cga gat aac ttt ate ggt tta atg tat tat aac age 
Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser 
800 805 810 

aca ggg aac atg gga gtt ctg gca ggt cag gcc teg cag tta aac gca 
Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala 
815 ' 820 825 
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gtt gta gac ctg caa gat cga aac acg gaa ctg tec tat cag eta atg 
Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met 
830 835 840 

eta gat get ctg ggt gac aga act cga tat ttc tea atg tgg aat cag 
Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp. Asn Gin 
845 850 855 

gec gtg gac age tac gat cca gac gtt agg att ate gag aac cat ggg 
Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie He Glu Asn His Gly 
860 865 870 875 



18804 



18852 



18900 



gtg gaa gac gag ctg ccc aat tac tgt ttt cca etc cca ggc atg ggt 18948 
Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Pro Gly Met Gly 
880 885 890 

att ttt aac tec tac aag ggg gta aaa cca caa aat ggc ggt aat ggt 
He Phe Asn Ser Tyr Lys Gly Val Lys Pro Gin Asn Gly Gly Asn Gly 
895 900 905 

aac tgg gaa gca aac ggg gac eta tea aat gec aat gag ate get tta 
Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn Glu He Ala Leu 
910 915 920 

gga aac att ttt gee atg gaa att aac etc cac gca aac ctg tgg cgc 
Gly Asn He Phe Ala Met Glu He Asn Leu His Ala Asn Leu Trp Arg 
925 930 935 

age ttc ttg tac age aat gtg gcg ctg tac ctg cca gac age tat aaa 
Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys 
940 945 950 955 

ttc act ccc get aac ate act ctg ccc gec aac caa aac ace tac gag 
Phe Thr Pro Ala Asn He Thr Leu Pro Ala Asn Gin Asn Thr Tyr Glu 
960 965 970 

tat ate aac ggg cgc gtc act tct cca acc ctg gtg gac ace ttt gtt 
Tyr He Asn Gly Arg Val Thr Ser Pro Thr Leu Val Asp Thr Phe Val 
975 980 985 

aac att gga gec cga tgg teg ccg gat ccc atg gac aac gtc aac ccc 19284 
Asn He Gly Ala Arg Trp Ser Pro Asp Pro Met Asp Asn Val Asn Pro 
990 995 1000 

ttt aac cat cac egg aac .gcg ggc etc cgt tac cgc tec atg ctg 19329 
Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu 
1005 1010 1015 

ctg gga aat gga cgc gtg gtg cct ttc cac ata caa gtg ccg caa 19374 
Leu Gly Asn Gly Arg Val Val Pro Phe His He Gin Val Pro Gin 
1020 1025 1030 

aaa ttt ttc gcg att aag aac etc ctg ctt ttg ccc ggc tec tac 19419 
Lys Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr 
1035 • 1040 1045 
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act tac gag tgg age ttc aga aaa gac gtg aac atg 
Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met 
1050 1055 1060 



att ctg cag 
lie Leu Gin 



age acc ctg ggc aat gat ctt cga gtg gac ggg gec age gtc cgc 
Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg 
1065 1070 1075 

att gac age gtc aac ttg tac gec aac ttt ttc ccc atg gcg cac 
He Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His 
1080 1085 1090 

aac acc get tct acc ttg gaa gec atg ctg cga aac gac acc aac 
Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn 
1095 1100 H05 



gac cag teg ttt aac gac tac 
Asp Gin Ser Phe Asn Asp Tyr 
1110 1H5 



etc age gcg gec aac atg ctt tat 
Leu Ser Ala Ala Asn Met Leu Tyr 
1120 



ccc att ccg gec aac gec acc aac gtt ccc att tec 
Pro He Pro Ala Asn Ala Thr Asn Val Pro He Ser 
H25 H30 H35 



att ccc tec 
He Pro Ser 



cgc aac tgg gcg gee ttc egg 
Arg Asn Trp Ala Ala Phe Arg 
H40 1145 



gga tgg age ttc acc cgc ctt aaa 
Gly Trp Ser Phe Thr Arg Leu Lys 
1150 



gec aag 
Ala Lys 
1155 

gtg tac 
Val Tyr 
1170 



gaa acg cct tec ttg 
Glu Thr Pro Ser Leu 
1160 

tea ggc acc att cct 
Ser Gly Thr lie Pro 
1175 



ggc tec ggc ttt gac 
Gly Ser Gly Phe Asp 
1165 



ccc tac ttt 
Pro Tyr Phe 



tac ctg gac ggc age ttt tac etc 
Tyr Leu Asp Gly Ser Phe Tyr Leu 
1180 



aac cac act ttc aaa cgt ctg 
Asn His Thr Phe Lys Arg Leu 
1185 H90 

agt tgg ccg ggc aac gac cgc 
Ser Trp Pro Gly Asn Asp Arg 
1200 1205 

att aag cgc att gtg gac ggg 
He Lys Arg He Val Asp Gly 
1215 1220 



tec ate atg ttc gat 
Ser He Met Phe Asp 
1195 



tct tec gta 
Ser Ser Val 



etc ctg acg ccg aac gag ttc gaa 
Leu Leu Thr Pro Asn Glu Phe Glu 
1210 

gaa ggc tac aac gtg get caa agt 
Glu Gly Tyr Asn Val Ala Gin Ser 
1225 



aac atg acc aaa gac tgg ttt tta att caa atg etc age cac tac 
Asn Met Thr Lys Asp Trp Phe Leu He Gin Met Leu Ser His Tyr 
1230 1235 1240 



aac ate 
Asn He 
1245 



ggc tac caa ggc ttc tat gtt ccc gag ggc 
Glv Tyr Gin Gly Phe Tyr Val Pro Glu Gly 
1250 1255 



tac aag gat 
Tyr Lys Asp 



19464 



19509 



19554 



19599 



19644 



19689 



19734 



19779 



19824 



19869 



19914 



19959 



20004 



20049 
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egg atg tat tct ttc ttc cga aac ttt cag ccc atg age 
Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser 
1260 1265 1270 

gtg ccg gat ccc acc get gee ggc tat caa gec gtt ccc 
Val Pro Asp Pro Thr Ala Ala Gly Tyr Gin Ala Val Pro 
12 75 1280 1285 

aga caa cac aac aac teg ggc ttt gtg ggg tac atg ggc 
Arg Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly 
12 90 1295 1300 

atg cgc gaa gga cag cca tac ccg gec aac tac ccc tat 
Met Arg Glu Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr 
1305 1310 1315 

ate ggc get acc gec gtc ccc gee att acc cag aaa aag 
He Gly Ala Thr Ala Val Pro Ala He Thr Gin Lys Lys 
1320 1325 1330 

tgc gac cgc gtc atg tgg cgc ata cct ttt tec age aac 
Cys Asp Arg Val Met Trp Arg He Pro Phe Ser Ser Asn 
1335 1340 1345 

tea atg ggg gec ctg acc gac etc gga cag aac atg ctt 
Ser Met Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu 
1350 1355 1360 

aac tec gec cat gee ctg gat atg act ttt gag gtg gac 
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp 
1365 1370 1375 

aac gag ccc acg ttg ctg tac atg ctt ttt gag gtg ttc 
Asn Glu Pro Thr Leu Leu Tyr Met Leu Phe . Glu Val Phe 
1380 1385 1390 

gtc aga gtg cac cag ccg cac cgc ggt att ate gag gec 
Val Arg Val His Gin Pro His Arg Gly He He Glu Ala 
1395 1400 1405 

ctg cgc acc ccc ttc tct gcg ggc aat gec acc aca taa 
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
1410 1415 1420 

tagctggttt ttaccccaga tcccatgggc tecaeggaag aegaactgeg 

cgagacctgg gctgcggacc ctacttcctg ggcacctttg acaageggtt 

gtgtctcctc gcaaactcgc gtgegegate gtgaataccg ccggccgaga 

gagcattggc tagctctggg ctggaacccc cgctcgtcca cgtttttcct 

tttggctttt cagaccaacg cttgaagcag atetatgeat ttgaatatga 

aagcgaagcg cgctggcctc ctccgccgat cactgtctaa ccctggtaaa 

aeggttcagg gccctcacag cgccgcctgt ggcctttttt gttgcatgtt 



cgc cag 
Arg Gin 



ctg ccc 
Leu Pro 



ccg acc 
Pro Thr 



ccc ctg 
Pro Leu 



ttt ttg 
Phe Leu 



ttt atg 
Phe Met 



tac get 
Tyr Ala 



ccc atg 
Pro Met 



gac gtg 
Asp Val 



gtg tac 
Val Tyr 



20094 



20139 



20184 



20229 



20274 



20319 



20364 



20409 



20454 



20499 



gccgctgaac 20548 



ggccattgtg 
tcccgggttc 
gaeeggagga 
gttcgacccc 
gggtctactc 
gagcactcag 
tttgcacgcc 



20608 
20668 
20728 
20788 
20848 
20908 
20968 
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tttgtgaact ggccggacac ccccatggaa aacaacccca ccatggacct cctgactggc 21028 
gttcccaact ccatgctcca aagccccagc gtgcagacca ccctcctcca aaaccagaaa 21088 
aatctgtacg cctttctgca caagcactct ccctactttc gccgocatcg ggaacaaata 21148 
gaaaatgcaa ccgcgtttaa caaaactctg taacgtttaa taaatgaact ttttattgaa 21208 
ctggaaaacg ggtttgtgat ttttaaaaat caaaggggtt gagctggaca tccatgtggg 21268 
aggccggaag ggtggtgttc ttgtactggt acttgggcag ccacttaaac tctggaatca 21328 
caaacttggg cagcggtatt tctgggaagt tgtcgtgcca cagctggcgg gtcagctgaa 21388 
gtgcctgcag aacatcgggg gcggagatct tgaagtcgca gtttatctgg ttcacggcac 21448 
gcgcgttgcg gtacatggga ttggcacact gaaacaccag caggctggga ttcttgatgc 21508 
tagccagggc cacggcgtcg gtcacgtcac cggtgtcttc tatgttggac agcgaaaaag 21568 
gcgtgacttt gcaaagctgg cgtcccgcgc gaggcacgca atctcccagg tagttgcact 21628 
cacagcggat gggcagaaga agatgcttgt ggccgcgggt catgtaggga taggccgctg 21688 
ccataaaagc ttcgatctgc ctgaaagcct gcttggcctt gtgcccttcg gtataaaaaa 21748 
caccgcagga cttgttggaa aaggtattac tggcgcaagc ggcatcgtga aagcaagcgc 21808 
gtgcgtcttc gtttcgtaac tgcaccacgc tgcggcccca ccggttctga atcaccttgg 21868 
ccctgccggg gttttccttg agagcgcgct ggccggcttc gctgcccaca tccatttcca 21928 
cgacatgctc cttgttaatc atggccagac cgtggaggca gcgcagctcc tcgtcatcgt 21988 
cggtgcagtg atgctcccac acgacgcagc cagtgggctc ccacttgggc ttggaggcct 22048 
cggcaatgcc agaatacagg agaacgtagt ggtgcagaaa acgtcccatc atggtgccaa 22108 
aggttttctg gctgctgaag gtcatcgggc agtacctcca gtcctcgtta agccaagtgt 22168 
tgcagatctt cctgaagacc gtgtactgat cgggcataaa -gtggaactca ttgcgctcgg 22228 
tcttgtcgat cttatacttt tccatcagac tatgcataat ctccatgccc ttttcccagg 22288 
cgcaaacaat cttggtgcta cacgggttag gtatggccaa agtggttggo ctctgaggcg 22348 
gcgcttgttc ttcctcttga gccctctccc gactgacggg ggttgaaaga gggtgcccct 22408 
tggggaacgg cttgaacacg gtctggcccg aggcgtcccg aagaatctgc atcgggggat 22468 
tgctggccgt catggcgatg atctgacccc ggggctcctc cacttcgtcc tcctcgggac 22528 
tttcctcgtg cttttcgggg gacggtacgg gagtaggggg aagagcgcgg cgcgccttct 22588 
tcttgggcgg cagttccgga gcctgctctt gacgactggc cattgtcttc tcctaggcaa 22648 
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gaaaaacaag atggaagact ctttctcctc ctcctcgtca acgtcagaaa gcgagtcttc 22708 
caccttaagc gccgagaact cccagcgcat agaatccgat gtgggctacg agactccccc 22768 
cgcgaacttt tcgccgcccc ccataaacac taacgggtgg acggactacc tggccctagg 22828 
agacgtactg ctgaagcaca tcaggcggca gagcgttatc gtgcaagatg ctctcaccga 22888 
gcgactcgcg gttccgctgg aagtggcgga acttagcgcc gcctacgagc gaaccctctt 22948 
ctccccaaag actcccccca agaggcaggc taacggcacc tgcgagccta accctcgact 23008 
caacttctac cctgcctttg ccgtgccaga ggtactggct acgtaccaca tttttttcca 23068 
aaaccacaaa atccctctct cgtgccgcgc caaccgcacc aaagccgatc gcgtgctgcg 23128 
actggaggaa ggggctcgca tacetgagat tgcgtgtctg gaggaagtcc caaaaatctt 23188 
tgaaggtctg ggccgcgacg aaaagcgagc agcaaacgct ctggaagaga acgcagagag 23248 
tcacaacagc gccttggtag aactcgaggg cgacaacgcc agactggccg tcctcaaacg 23308 
gtccatagaa gtcacgcact tcgcctaccc cgccgttaac ctccctccaa aagttatgac 23368 
agcggtcatg gactcgctgc tcataaagcg cgctcagccc ttagacccag agcacgaaaa 23428 
caacagtgac gaaggaaaac cggtggtttc tgatgaggag ttgagcaagt ggctgtcctc 23488 
caacgacccc gccacgttgg aggaacgaag aaaaaccatg atggccgtgg tgctagttac 23548 
cgtgcaatta gaatgtctgc agaggttctt ttcccaccca gagaccctga gaaaagtgga 23608 
ggaaacgctg cactacacat ttaggcacgg ctacgtgaag caagcctgca agatttccaa 23668 
cgtagaactt agcaacctca tctcctacct ggggatcttg cacgaaaacc gcctcggaca 23728 
aaacgtgctg cacagcacac tgaaaggaga agcccgccga gactatgtgc gagactgcgt 23788 
gttcctagcg ctagtgtaca cctggcagag cggaatggga gtctggcagc agtgcctgga 23848 
ggacgaaaac ctcaaagagc ttgaaaagct gctggtgcgc tccagaaggg cactgtggac 23908 
cagttttgac gagcgcaccg ccgcgcgaga cctagctgat attatttttc ctcccaagct 23968 
ggtgcagact ctccgggaag gactgccaga ttttatgagt caaagcatct tgcaaaactt 24028 
ccgctctttc atcttggaac gctcgggaat cttgcccgcc actagctgcg ccctacccac 24088 
agattttgtg cbtctccact accgcgaatg cccaccgccg ctgtggccgt acacttactt 24148 
gcttaaactg gccaactttc taatgttcca ctctgacctg gcagaagacg ttagcggcga 24208 
ggggctgcta gaatgccact gccgctgcaa cctgtgcacc ccccaccgct ctctagtatg 24268 
caacactccc ctgctcaatg agacccagat catcggtacc tttgaaatcc agggaccctc 24328 
cgacgcggaa aacggcaagc aggggtctgg gctaaaactc acagccggac tgtggacctc 24388 
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cgcctacttg cgcaaatttg tacoagaaga ctatcacgcc caccaaatta aattttacga 24448 
aaaccaatca aaaccaccca aaagcgagtt aacggcttgc gtoattaogc agagcagcat 24508 
agttgggcag ttgcaagcca ttaacaaagc gcggcaagag tttctcctaa aaaaaggaaa 24568 
aggggtctac ttggaccccc agaccggcga ggaactcaac ggaccctcct cagtcgcagg 24628 
ttgtgtgccc catgccgccc aaaaagaaca cctcgcagtg gaacatgcca gagacggagg 24688 
aagaggagtg gagcagtgtg agcaacagcg aaacggagga agagccgtgg cccgaggggt 24748 
gcaacgggga agaggacacg gagggacggc gaagtcttcg ccgaagaact ctcgccgctg 24808 
cccccgaagt cccagccggc cgcctcggcc caagatcccg cacacacccg tagatgggat 24868 
agcaagacca aaaagccggg taagagaaac gctcgccccc gccagggcta ccgctcgtgg 24928 
agaaagcaca aaaactgcat cttatcgtgc ttgctccagt gcggcggaga cgtttcgttc 24988 
acccgtagat acttgctttt taacaaaggg gtggccgtcc cccgtaacgt cctccactac 25048 
taccgtcact cttacagctc cgaagcggac ggctaagaaa acgcagcagt tgccggcggg 25108 
aggactgcgt ctcagcgccc gagaaccccc agccaccagg gagctocgaa accgcatatt 25168 
tcccaccctc tacgctatct ttcagcaaag ccgggggcag cagcaagaac tgaaaataaa 25228 
aaaccgcacg ctgaggtcgc ttacccgaag ctgcctctat cacaagagcg aagagcagct 25288 
gcagcgaacc ctggaggacg cagaagcgct gttccagaag tactgcgcga ccaccctaaa 25348 
taactaaaaa agcccgcgcg cgggacttca aaccgtctga cgtcaccagc cgcgcgccaa 25408 
aatgagcaaa gagattccca cgccttacat gtggagttac cagccgcaga tgggattagc 25468 
cgccggcgcc gcccaggatt actccacgaa aatgaactgg ctcagcgccg ggccccacat 25528 
gatttcccgc gtaaacgaca ttcgcgccca ccgcaatcag ctattgttag aacaggctgc 25588 
tctgaccgcc acgccccgta ataacctgaa ccctcccagc tggccagctg ccctggtgta 25648 
ccaggaaacg cctccaccca ccagcgtact tttgccccgt gacgcccagg cggaagtcca 25708 
gatgactaac gcgggcgcgc aattagcggg cggatcccgg tttcggtaca gagttcacgg 25768 
cgccgcaccc tatagcccag gtataaagag gctgatcatt cgaggcagag gtgtccagct 25828 
caacgacgag acagtgagct cttcgcttgg tctacgacca gacggagtgt tccagctcgc 25888 
gggctcgggc cgctcttcgt tcacgcctcg ccaggcatac ctgactctgc agagctctgc 25948 
ctctcagcct cgctcgggag gaatcggacc ccttcagttt gtggaggagt ttgtgccctc 26008 
ggtctacttt cagcctttct ccggatcgcc cggccagtac ccggacgagt tcatccccaa 26068 
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cttcgacgcg gtgagtgact ctgtggacgg ttatgactga tgtcgagccc gcttcagtgc 26128 
tagtggaaca agcgcggctc aatcacctgg ttogttgccg ccgccgctgc tgcgtggctc 26188 
gcgacttgag cttagctctc aagtttgtaa aaaacccgtc cgaaaccggg agcgctgtgc 26248 
acgggttgga gctagtgggt cctgagaagg ccaccatcca cgttctcaga aactttgtgg 26308 
aaaaacccat tttggttaaa cgagatcagg ggccttttgt aatcagctta ctctgcacct 26368 
gtaaccatgt tgaccttcac gactatttta tggatcattt gtgcgctgaa ttcaataagt 26428 
aaagcgaatt cttaccaaga ttatgatgtc catgactgtt cctcgccact atacgatgtt 26488 
gtgccagtaa actctcttgt cgacatctat ctgaactgtt ccttttggtc cgcacagctt 26548 
acttggtact acggtgacac cgtcctttct ggctcactgg gcagctcaca cggaataaca 26608 
cttcacctct tttcgccgtt tcgatacgga aactacagct gtcgtgccgg tacotgcctc 26668 
cacgttttca atcttcagcc ctgtccaccg accaaacttg tatttgtcga ctctaagcac 26728 
ttacagctca actgcagcat tctaggcccc agtatcttgt ggacatacaa taaaatcagg 26788 
ttggtggaat ttgtctacta cccacccagc gcccgcggtt ttggggaaat tcctttccag 26848 
atctactaca actatcttgc cacacattat gcaagtcaac agcaactaaa cttgcaagca 26908 
cccttcacgc caggagagta ctcctgtcac gtaggctcct gcacagaaac ttttattctc 26968 
ttcaacagat cttctgccat tgaacgcttc actactaact actttagaaa ccaagttgtg 27028 
cttttcactg acgaaacccc taacgtcacc otggactgtg catgtttttc tcatgacacc 27088 
gtaacttgga ctcttaacaa tactctctgg ctcgcgttcg ataaccaaag cttgattgtt 27148 
aaaaattttg atttaacctt tactaaaccc tctcctcgog aaatagttat ctttgctcct 27208 
tttaatccaa aaactacctt agcctgtcag gttttgttta agccttgcca aacaaacttt 27268 
aagtttgttt atttgcctcc gcaatctgtc aaactcatag aaaaatacaa caaagcgccc 27328 
gtcttggctc ctaaaacctt ctaccactgg ctaacctaca cggggctgtt tgcactaatt 27388 
gtttttttcc taattaacat ttttatatgt ttcttgcctt cctccttctt ttcgcgaaca 27448 
ccgttgccgc agaaagacct ctccttatta ctgtagcgct tgctatacaa aaccaagagt 27508 
ggtcaaccgt gctctcaatc tattttcaat ttttcatttt gtccttaata ctttctctta 27568 
ttgtcgttaa caatgatctg gagcattggt ctcgcctttt tttggctgct tagtgcaaaa 27628 
gccactattt ttcacaggta tgtggaagaa ggaactagca ccctctttac gatacctgaa 27688 
acaattaagg cggctgatga agtttcttgg tacaaaggct cgctctcaga cggcaaccac 27748 
tcattctcag gacagaccct ttgcatccaa gaaacttatt ttaaatcaga actacaatac 27808 
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agctgcataa aaaacttttt ccatctctac aacatctcaa aaccctatga gggtatttac 27868 
aatgccaagg tttcagacaa ctccagcaca cggaactttt actttaatct gacagttatt 27928 
aaagcaattt ccattcctat ctgtgagttt agctcccagt ttctttctga aacctactgt 27988 
ttaattacta taaactgcac taaaaatcgc cttcacacca ccataatcta caatcacaca 28048 
caatcacctt gggttttaaa cctaaaattt tctccacaca tgccttcgca atttctcacg 28108 
caagttaccg tctctaacat aagcaagcag tttggctttt actatccttt ccacgaactg 28168 
tgcgaaataa ttgaagccga atatgaacca gactacttta cttacattgc cattggtgta 28228 
atcgttgttt gcctttgctt tgttattggg gggtgtgttt atttgtacat tcagagaaaa 28288 
atattgctct cgctgtgctc ctgcggttac aaagcagaag aaagaattaa aatctctaca 28348 
ctttattaat gttttccaga aatggcaaaa ctaacgctcc tacttttgct tctcacgccg 28408 
gtgacgcttt ttaccatcac tttttctgcc gccgccacac tcgaacctca atgtttgcca 28468 
ccggttgaag tctactttgt ctacgtgttg ctgtgctgcg ttagcgtttg cagtataaca 28528 
tgttttacct ttgtttttct tcagtgcatt gactacttct gggtcagact ctactaccgc 28588 
agacacgcgc ctcagtatca aaatcaacaa attgccagac tactcggtct gccatgattg 28648 
tcttgtattt taccctgatt ttttttcacc ttacttgcgc ttgtgatttt cacttcaotc 28708 
aattttggaa aacgcaatgc ttcgacccgc gcctctccaa cgactggatg atggctcttg 28768 
caattgccac gcttggggcg tttggacttt ttagtggttt tgctttgcat tacaaattta 28828 
agactccatg gacacatggc tttitttcag attttccagt tacacctact ccgccgcctc 28888 
ccccggccat cgacgtgcct caggttccct caccttctcc atctgtctgc agctactttc 28948 
atctgtaatg gccgacctag aatttgacgg agtgcaatct gagcaaaggg ctatacactt 29008 
ccaacgccag tcggaccgcg aacgcaaaaa cagagagctg caaaccatac aaaacaccca 29068 
ccaatgtaaa cgcgggatat tttgtattgt aaaacaagct aagctccact acgagcttct 29128 
atctggcaac gaccacgagc tccaatacgt ggtcgatcag cagcgtcaaa cctgtgtatt 29188 
cttaattgga gtttccccca ttaaagttac tcaaaccaag ggtgaaacca agggaaccat 29248 
aaggtgctca tgtcacctgt cagaatgcct ttacactcta gttaaaaccc tatgtggctt 29308 
acatgattct atccccttta attaaataaa cttactttaa atctgcaatc acttcttcgt 29368 
ccttgttttt gtcgccatcc agcagcacca ccttcccctc ttcccaactt tcatagcata 29428 
ttttccgaaa agaggcgtac tttcgccaca ccttaaaggg aacgtttact tcgctttcaa 29488 
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gctctcccac gattttcatt gcagat atg aaa cgc gcc aaa gtg gaa gaa gga 

Met Lys Arg Ala Lys Val Glu Glu Gly 
1425 

ttt aac ccc gtt tat ccc tat gga tat tct act ccg act gac gtg 
Phe Asn Pro Val Tyr Pro Tyr Gly Tyr Ser Thr Pro Thr Asp Val 
1430 1435 " 1440 

get cct ccc ttt gta gcc tct gac ggt ctt caa gaa aac cca cct 
Ala Pro Pro Phe Val Ala Ser Asp Gly Leu Gin Glu Asn Pro Pro 
1445 1450 1455 



ggg gtc ttg tec eta aaa ata tec aaa cct tta 
Gly Val Leu Ser Leu Lys He Ser Lys Pro Leu 
1460 1465 1470 



act ttt aat gcc 
Thr Phe Asn Ala 



tec aag get eta age ctg get att ggt cca gga tta aaa att caa 
Ser Lys Ala Leu Ser Leu Ala He Gly Pro Gly Leu Lys He Gin 
1475 1480 1485 

gat ggt aaa eta gtg ggg gag gga caa gca att ctt gca aac ctg 
Asp Gly Lys Leu Val Gly Glu Gly Gin Ala He Leu Ala Asn Leu 
1490 " " 1495 1500 

ccg ctt caa ate acc aac aac aca att tea eta cgt ttt ggg aac 
Pro Leu Gin lie Thr Asn Asn Thr lie Ser Leu Arg Phe Gly Asn 
1505 1510- 1515 

aca ctt gcc ttg aat gac aat aat gaa etc caa acc aca eta aaa 
Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gin Thr Thr Leu Lys 
1520 1525 1530 

tct tea teg ccc ctt aaa ate aca gac cag act ctg tec ctt aac 
Ser Ser Ser Pro Leu Lys He Thr Asp Gin Thr Leu Ser Leu Asn 
1535 1540 1545 

ata ggg gac age ctt gca ' att aaa gat gac aaa eta gaa age get 
He Gly Asp Ser Leu Ala He Lys Asp Asp Lys Leu Glu Ser Ala 
1550 * 1555 1560 

ctt caa gcg acc etc cca etc tec att age aac aac acc ate age 
Leu Gin Ala Thr Leu Pro Leu Ser lie Ser Asn Asn Thr He Ser 
1565 1570 1575 

etc aac gtg ggc acc gga etc acc ata aat gga aac gtt tta caa 
Leu Asn Val Gly Thr Gly Leu Thr He Asn Gly Asn Val Leu Gin 
1580 ~ 1585 1590 

get gtt ccc tta aat get eta agt ccc eta act att tec aac aat 
Ala Val Pro Leu Asn Ala Leu Ser Pro Leu Thr He Ser Asn Asn 
1595 1600 1605 

aac ate age ctg cgc tat ggc agt tec ctg acg gtg ctt aac aat 
Asn He Ser Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn 
1610 1615 1620 



29541 



29586 



29631 



29676 



29721 



29766 



29811 



29856 



29901 



29946 



29991 



30036 



30081 



30126 
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gaa ctg caa age aac etc aca gtt cac tec cct tta aaa etc aac 30171 

Glu Leu Gin Ser Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn 

1625 1630 1635 

tec aac aac tea att tct etc aac act eta tct ccg ttt aga ate 30216 

Ser Asn Asn Ser He Ser Leu Asn Thr Leu Ser Pro Phe Arg He 

1640 1645 1650 

gag aat ggt ttc etc acg etc tat ttg gga aca aaa tct ggc ttg 30261 

Glu Asn Gly Phe Leu Thr Leu Tyr Leu Gly Thr Lys Ser Gly Leu 

1655 1660 1665 



eta gtt caa aac agt ggc 
Leu Val Gin Asn Ser Gly 
1670 1675 



tta aaa gtt caa gcg ggc tac ggc ctg 
Leu Lys Val Gin Ala Gly Tyr Gly Leu 
1680 



30306 



caa gta aca gac acc aat get etc aca tta aga tat etc get cca 
Gin Val Thr Asp Thr Asn Ala Leu Thr Leu Arg Tyr Leu Ala Pro 
1685 1690 1695 



30351 



ctg acc att cca gac teg ggc tea gaa caa ggc att ctt aaa gta 
Leu Thr He Pro Asp Ser Gly Ser Glu Gin Gly He Leu Lys Val 
1700 * 1705 ' 1710 



30396 



aac act gga cag ggc eta agt gtg aac caa get 
Asn Thr Gly Gin Gly Leu Ser Val Asn Gin Ala 
1715 * 1720 1725 



gga gcg ctt gaa 
Gly Ala Leu Glu 



30441 



aca tec eta gga ggt gga tta aaa tat get gat aac aaa ata acc 
Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp Asn Lys He Thr 
1730 1735 1740 



304S6 



ttt gat aca gga aac gga ctg aca tta tct gaa aat aaa ctt gca 
Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn Lys Leu Ala 
1745 "* 1750 1755 



30531 



gta get gca ggt agt ggt 
Val Ala Ala Gly Ser Gly 
1760 1765 



eta act ttt aga gat 
Leu Thr Phe Arg Asp 
1770 



ggt gec ttg gta 
Gly Ala Leu Val 



30576 



gec acg gga acc gca ttt acg caa aca ctg tgg act acg get gat 
Ala Thr Gly Thr Ala Phe Thr Gin Thr Leu Trp Thr Thr Ala Asp 
1775 1780 1785 



30621 



ccg tct ccc aac tgc aca att ata cag gac cgc gac aca aaa ttt 
Pro Ser Pro Asn Cys Thr He He Gin Asp Arg Asp Thr Lys Phe 
1790 1795 1800 



30666 



act ttg gcg ctt acc att agt ggg age caa gtg 
Thr Leu Ala Leu Thr He Ser Gly Ser Gin Val 
1805 1810 1815 



ctg ggg acg gtt 
Leu Gly Thr Val 



30711 



tec att att gga gta aaa ggc ccc ctt tea agt age ata ccg tea 
Ser He He Gly Val Lys Gly Pro Leu Ser Ser Ser He Pro Ser 
1820 " 1825 1830 



30756 
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get acc gtt aca gta caa ctt aac ttt gat tec aac gga 
Ala Thr Val Thr Val Gin Leu Asn Phe Asp Ser Asn Gly 
1835 1840 1845 

ttg age tec tct tea ctt aaa ggt tac tgg ggg tat cge 
Leu Ser Ser Ser Ser Leu .Lys Gly Tyr Trp Gly Tyr Arg 
1850 1855 I860 

ccc tea att gac cct tac ccc ata att aat gec tta aac 
Pro Ser lie Asp Pro Tyr Pro He He Asn Ala Leu Asn 
1865 1870 1875 

cca aac tea ctg get tat ccc ccg gga caa gaa ate caa 
Pro Asn Ser Leu Ala Tyr Pro Pro Gly Gin Glu He Gin 
1880 1885 1890 

tgt aac atg tac gtt tct act ttt tta cga gga aat cca 
Cys Asn Met Tyr Val Ser Thr Phe Leu Arg Gly Asn Pro 
1895 " 1900 1905 

cca ata gtt tta aac ate act ttt aat aat caa acc age 
Pro He Val Leu Asn He Thr Phe Asn Asn Gin Thr Ser 
1910 1915 1920 

tec att aga ttt aca tgg aca aat tta acc aca gga gaa 
Ser He Arg Phe Thr Trp Thr Asn Leu Thr Thr Gly Glu 
1925 1930 1935 

gca atg ccc cca tgc act ttt tec tac att get gaa caa 
Ala Met Pro Pro Cys Thr Phe Ser Tyr He Ala Glu Gin 
1940 1945 1950 

actatgtaac cctcaccgtt aacccgcctc cgcccttcca ttttatttta 

gatccacctt ttcagcagta aacaattgea tgtcagtagg ggcagtaaaa 

ttaaaatcca cacaggttct tcacaagcta agegaaaate agttacactt 

cgctaacatc ggacaaagac aagcatgagt ccaaagcttc cggttctgga 

gttcattaac agegggagaa acagcttctg gaggattttc catctccatc 

gttccaccat gtccaccgtg gtcatctggg acgagaacga cagttgtcat 

agtcaccggt cgatgacgaa egtacagate tcgaagaatg tcctgtcgcc 

agcactgggc egaaggegaa agcgcccatg tttaacaatg gccagcaccg 

caggcgccta gttcttttag cgcaacagcg catgcgcagc tegctaagae 

aacacagcac agaaccacca gattgttcat gatcccataa gcgtgctgac 

actaacaaat tgtttcacta ttctagcatg aatgtcatat ctgatgttca 

atggcgcccc cttatgtaaa cacttcccac gtacaacacc tcctttggca 

aaccacctcc cgataccaaa tacatctctg attaatagtc gccccgtaca 



gee eta 
Ala Leu 



caa ggt 
Gin Gly 



ttt atg 
Phe Met 



gca aaa 
Ala Lys 



caa aga 
Gin Arg 



ggg ttt 
Gly Phe 



gca ttt 
Ala Phe 



caa taa 
Gin 



taaaccaccc 
cttttgggag 
ataaaaccat 
tcagattttt 
tccttcatca 
acacctcata 
gcctttcggc 
cccgcttcat 
tggegcaaga 
accagcccat 
agtaaattaa 
tctgataatt 
ctacccgatt 



30801 

30846 

30891 

30936 

30981 

31026 

31071 

31116 

31176 
31236 
31296 
31356 
31416 
31476 
31536 
31596 
31656 
31716 
31776 
31836 
31896 
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aaaccaagtt gccaacataa tcccccctgc catacactgc aaagaacctg gacggctaca 31956 

atgacagtgc aaagtccaca cctcgttgcc atggataact gaggaacgcc ttaagtcaat 32016 

agtggcacaa ctaatacaaa catgtaaata gtgtttcaac aagtgccact cgtatgaggt 32076 

gagtatcatg tcccagggaa cgggccactc cataaacact gcaaaaccaa cacatcctac 32136 

catcccccgc acggcactca catcgtgcat ggtgttcata tcacagtccg gaagctgagg 32196 

acaaggaaaa gtctcgggag cattttcata gggcggtagt gggtactcct tgtaggggtt 32256 

cagtcggcac cggtatctcc tcaccttctg ggccataaca cacaagttga gatctgattt 32316 

caaggtactt tctgaatgaa aaccaagtgc tttcccaaca atgtatccga tgtcttcggt 32376 

ccccgcgtcg gtagcgctcc ttgcagtaca cacggaacaa ccactcacgc aggcccagaa 32436 

gacagttttc cgcggacggt gacaagttaa tccccctcag tctcagagcc aatatagttt 32496 

cttccacagt agcataggcc aaacccaacc aggaaacaca agctggcacg tcccgttcaa 32556 

cgggaggaca aggaagcaga ggcagaggca taggcaaagc aacagaattt ttattccaac 32616 

tggtcacgta gcacttcaaa caccaggtca cgtaaatggc agcgatcttg ggtttcctga 32676 

tggaacataa cagcaagatc aaacatgaga cgattctcaa ggtgattaac cacagctgga 32736 

attaaatcct ccacgcgcac atttagaaac accagcaata caaaagcccg gttttctccg 32796 

ggatctatca tagcagcaca gtcatcaatt agtcccaagt aattttcccg tttccaatct 32856 

gttataattt gcagaataat gccctgtaaa tccaagccgg ccatggcgaa aagctcagat 32916 

aatgcacttt ccacgtgcat tcgtaaacac accctcatct tgtcaatcca aaaagtcttc 32976 

ttcttgagaa acctgtagta aattaagaat cgccaggtta ggctcgatgc ctacatcccg 33036 

gagcttcatt ctcagcatgc actgcaaatg atccagcaga tcagaacagc aattagcagc 33096 
cagctcatcc ccggtttcca gttccggagt tcccacggca attatcactc gaaacgtggg 33156 
acaaatcgaa ataacatgag ctcccacgtg agcaaaagcc gtagggccag tgcaataatc 33216 
acagaaccag cggaaaaaag attgcagctc atgtttcaaa aagctctgca gatcaaaatt 33276 
cagctcatgc aaataacaca gtaaagtttg cggtatagta accgaaaacc acacgggtcg 33336 
acgttcaaac atctcggctt acctaaaaaa gaagcacatt tttaaaccac agtcgcttcc 33396 
tgaacaggag gaaatatggt gcggcgtaaa accagacgcg ccaccggatc tccggcagag 33456 
ccctgataat acagccagct gtggttaaac agcaaaacct ttaattcggc aacggttgag 33516 
gtctccacat aatcagcgcc cacaaaaatc ccatctcgaa cttgctcgcg tagggagcta 33576 
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aaatggccag tatagcccca tggcacccga acgctaatct gcaagtatat gagagccacc 33636 

ccattcggcg ggatcacaaa atcagtcgga gaaaacaacg tatacacccc ggactgcaaa 33696 

agctgttcag gcaaacgccc ctgcggtccc tctcggtaca ccagcaaagc ctcgggtaaa 33756 

gcagccatgc caagcgctta ccgtgccaag agcgactcag acgaaaaagt gtactgaggc 33816 

gctcagagca gcggctatat actctacctg tgacgtcaag aaccgaaagt caaaagttca 33876 

cccggcgcgc ccgaaaaaac ccgcgaaaat ccacccaaaa agcccgcgaa aaacacttcc 33936 

gtataaaatt tccgggttac cggcgcgtca ccgccgcgcg acacgcccgc cccgccccgc 33996 

gctcctcccc gaaacccgcc gcgcccactt ccgcgttccc aagacaaagg tcgcgtaact 34056 

ccgcccacct catttgcatg ttaactcggt cgccatcttg cggtgttata ttgatgatg 34115 

<210> 35 

<211> 503 

<212> PRT 

<213> simian adenovirus SV-39 

<400> 35 

Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu Gly Pro 
X 5 10 15 

Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gin Ala Pro 
20 25 30 

Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr Gly Gly 
35 40 45 

Arg Asn Ser lie Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp Thr Thr 
50 55 60 

Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Thr Leu Asn 
65 70 75 80 

Tvr Gin Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gin Asn Ser 
85 90 95 

Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gin Thr He Asn Leu Asp Asp 
100 105 HO 

Atq Ser Arg Trp Gly Gly Asp Leu Lys Thr He Leu His Thr Asn Met 
115 120 125 

Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala Lys Leu 
130 135 140 

Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp Val Gin 
145 150 155 160 
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Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu lie Met Thr He Asp Leu 
165 170 175 

Met Asn Asn Ala He He Asp His Tyr Leu Ala Val Ala Arg Gin Gin 
180 185 190 

Gly Val Lys Glu Ser Glu He Gly Val Lys Phe Asp Thr Arg Asn Phe 
195 200 205 

Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly Val Tyr 
210 * 215 220 

Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly Cys Gly 
225 230 235 240 

Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly He Arg Lys 
245 250 255 

Arg Met Pro Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp Leu Glu 
260 265 270 

Gly Gly Asn He Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu Glu Ser 
275 280 285 

He Ala Asn Ala Arg Glu Ala Ala He Arg Gly Asp Asn Phe Ala Ala 
290 295 300 

Gin Pro Gin Ala Ala Pro Thr He Lys Pro Val Leu Glu Asp Ser Lys 
305 310 315 320 

Gly Arg Ser Tyr Asn Val He Ala Asn Thr Asn Asn Thr Ala Tyr Arg 
325 330 335 

Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 
340 345 350 

Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser Glu Gin 
355 360 365 

Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr Phe Arg 
370 375 380 

Ser Thr Gin Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Met 
385 390 . 395 400 

Pro He His Ser Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin 
405 410 415 

Leu He Arg Gin Thr Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro 
420 425 430 

Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val 
435 440 445 

Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Gin 
450 455 460 
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Asn Ser lie Arg Gly Val Gin Arg Val Thr He Thr Asp Ala Arg Arg 
465 470 475 480 

Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly He Val Ala Pro Arg 
485 490 495 

Val Leu Ser Ser Arg Thr Phe 
500 



<210> 36 
<211> 917 
<212> PRT 

<213> simian adenovirus SV-39 
<400> 36 

Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His He Ala 
15 10 15 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Glu Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr He Arg Phe Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr 
65 70 75 80 

Lys Thr Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp He Arg Gly Val He Asp Arg Gly Pro Ser 
100 105 HO 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Asn Ser Gin Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro 
130 135 140 

Val Cys Phe Ala Gin Ala Ala Phe He Gly Gin Ser He Thr Lys Asp 
145 150 155 160 

Gly Val Gin He Gin Asn Ser Glu Asn Gin Gin Ala Ala Ala Asp Lys 
165 170 175 

Thr Tyr Gin Pro Glu Pro .Gin He Gly Val Ser Thr Trp Asp Thr Asn 
180 185 190 

Val Thr Ser Asn Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met 
195 200 205 
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Leu Pro Cys Tyr Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly 
210 * 215 220 

Gin Ala Lys Thr Glu Gly Asp He Ser Leu Asn Phe Phe Thr Thr Thr 
225 230 235 240 

Ala Ala Ala Asp Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val 
245 250 255 

Asn Leu Gin Ala Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly 
260 265 270 

Glu Asn Val He Ala Ala Glu Ala Leu Leu Thr Gin Gin Ala Cys Pro 
275 280 285 

Asn Arg Ala Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met 
290 295 300 

Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser 
305 310 315 320 

Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser 
325 330 335 

Tyr Gin Leu Met Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser 
340 345 350 

Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He 
355 360 365 

Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu 
370 375 380 

Pro Gly Met Gly He Phe Asn Ser Tyr Lys Gly Val Lys Pro Gin Asn 
385 390 395 400 

Gly Gly Asn Gly Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn 
405 410 415 

Glu He Ala Leu Gly Asn He Phe Ala Met Glu He Asn Leu His Ala 
420 425 430 

Asn Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro 
435 440 445 

Asp Ser Tyr Lys Phe Thr Pro Ala Asn He Thr Leu Pro Ala Asn Gin 
450 * 455 460 

Asn Thr Tyr Glu Tyr He Asn Gly Arg Val Thr Ser Pro Thr Leu Val 
465 * 470 475 480 

Asp Thr Phe Val Asn He Gly Ala Arg Trp Ser Pro Asp Pro Met Asp 
485 490 495 

Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 
500 505 510 
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Ser Met Leu Leu Gly Asn Gly Arg Val Val Pro Phe His lie Gin Val 
515 520 525 

Pro Gin Lys Phe Phe Ala lie Lys Asn Leu Leu Leu Leu Pro Gly Ser 
530 535 540 

Tyr Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met lie Leu Gin 
545 " 550 555 560 

Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg He 
565 570 575 

Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr 
580 585 590 

Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser 
595 600 605 

Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro He Pro Ala 
610 615 620 

Asn Ala Thr Asn Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala 
625 630 635 640 

Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Ala Lys Glu Thr Pro Ser 
645 650 655 

Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Thr He Pro 
660 665 670 

Tyr Leu Asp Gly Ser Phe Tyr Leu Asn His Thr Phe Lys Arg Leu Ser 
675 680 685 

He Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu 
690 695 700 

Thr Pro Asn Glu Phe Glu He Lys Arg He Val Asp Gly Glu Gly Tyr 
705 710 715 720 

Asn Val Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu He Gin Met 
725 730 735 

Leu Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly 
740 745 750 

Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser 
755 760 765 

Arg Gin Val Pro Asp Pro Thr Ala Ala Gly Tyr Gin Ala Val Pro Leu 
770 775 780 

Pro Arg Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
785 790 795 800 

Met Arg Glu Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He 
805 810 815 
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Gly Ala Thr Ala Val Pro Ala He Thr Gin Lys Lys Phe Leu Cys Asp 
820 825 830 

Arg Val Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met Ser Met Gly 
835 340 845 

Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 
850 855 860 

Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asn Glu Pro Thr Leu 
865 870 875 880 

Leu Tyr Met Leu Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro 
885 890 895 

His Arg Gly He He Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala 
900 905 910 

Gly Asn Ala Thr Thr 
915 



<210> 37 
<211> 533 
<212> PRT 

<213> simian adenovirus SV-39 
<400> 37 

Met Lys Arg Ala Lys Val Glu Glu Gly Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

Gly Tyr Ser Thr Pro Thr Asp Val Ala Pro Pro Phe Val Ala Ser Asp 
20 ~ 25 30 

Gly Leu Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Lys He Ser Lys 
35 40 45 

Pro Leu Thr Phe Asn Ala Ser Lys Ala Leu Ser Leu Ala He Gly Pro 
50 55 60 

Gly Leu Lys He Gin Asp Gly Lys Leu Val Gly Glu Gly Gin Ala He 
65 70 75 80 

Leu Ala Asn Leu Pro Leu Gin He Thr Asn Asn Thr He Ser Leu Arg 
85 90 95 

Phe Gly Asn Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gin Thr Thr 
100 105 HO 

Leu Lys Ser Ser Ser Pro Leu Lys He Thr Asp Gin Thr Leu Ser Leu 
115 120 125 

Asn He Gly Asp Ser Leu Ala He Lys Asp Asp Lys Leu Glu Ser Ala 
130 135 140 
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Leu Gin Ala Thr Leu Pro Leu Ser lie Ser Asn Asn Thr lie Ser Leu 
145 150 155 160 

Asn Val Gly Thr Gly Leu Thr He Asn Gly Asn Val Leu Gin Ala Val 
165 170 175 

Pro Leu Asn Ala Leu Ser Pro Leu Thr lie Ser Asn Asn Asn He Ser 
180 185 190 

Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn Glu Leu Gin Ser 
195 ~ 200 205 

Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn Ser Asn Asn Ser He 
210 215 220 

Ser Leu Asn Thr Leu Ser Pro Phe Arg He Glu Asn Gly Phe Leu Thr 
225 230 235 240 

Leu Tyr Leu Gly Thr Lys Ser Gly Leu Leu Val Gin Asn Ser Gly Leu 
245 ■ " 250 255 

Lys Val Gin Ala Gly Tyr Gly Leu Gin Val Thr Asp Thr Asn Ala Leu 
260 265 270 

Thr Leu Arg Tyr Leu Ala Pro Leu Thr He Pro Asp Ser Gly Ser Glu 
275 280 285 

Gin Gly He Leu Lys Val Asn Thr Gly Gin Gly Leu Ser Val Asn Gin 
290 295 300 

Ala Gly Ala Leu Glu Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp 
305 ^ 310 315 320 

Asn Lys He Thr Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn 
325 330 335 

Lys Leu Ala Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala 
340 345 350 

Leu Val Ala Thr Gly Thr Ala Phe Thr Gin Thr Leu Trp Thr Thr Ala 
355 360 365 

Asp Pro Ser Pro Asn Cys Thr He He Gin Asp Arg Asp Thr Lys Phe 
370 375 380 

Thr Leu Ala Leu Thr He Ser Gly Ser Gin Val Leu Gly Thr Val Ser 
385 390 395 400 

He He Gly Val Lys Gly Pro Leu Ser Ser Ser He Pro Ser Ala Thr 
405 410 415 

Val Thr Val Gin Leu Asn Phe Asp Ser Asn Gly Ala Leu Leu Ser Ser 
420 425 430 

Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gin Gly Pro Ser He Asp 
435 440 445 



216 



WO 03/046124 PCT/US02/33645 



Pro Tyr Pro He He Asn Ala Leu Asn Phe Met Pro Asn Ser Leu Ala 
450 455 460 

Tyr Pro Pro Gly Gin Glu He Gin Ala Lys Cys Asn Met Tyr Val Ser 
465 470 475 480 

Thr Phe Leu Arg Gly Asn Pro Gin Arg Pro He Val Leu Asn He Thr 
485 490 495 

Phe Asn Asn Gin Thr Stsr Gly Phe Ser He Arg Phe Thr Trp Thr Asn 
500 505 510 

Leu Thr Thr Gly Glu Ala Phe Ala Met Pro Pro Cys Thr Phe Ser Tyr 
515 520 525 

He Ala Glu Gin Gin 
530 



<210> 


38 


<211> 


50 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


oligomer SV25T 


<400> 


38 



aatttaaata cgtagcgcac tagtcgcgct aagcgcggat atcatttaaa 50 



<210> 


39 


<211> 


49 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


oligomer SV25B 


<400> 


39 



tatttaaatg atatccgcgc ttaagcgcga ctagtgcgct acgtattta 49 
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Groups 23. 61, 99, 135, 168, and 195, claim(s) 3, 8-10, 28 and 29, each drawn to ihc hexon protein of Ad Pan-5, Ad Pan-6, Ad Pan-7, 
Ad SV-i . Ad SV-25 and Ad SV-39, respectively. 

Groups 38. 76, 1 14, 146. 147, 174. 175, and 204, claim(s) 3, 8, 9, 1 1 , 28 and 29, each drawn to the Ad Pan-5 fiber protein, the Ad 
Pan-6 fiber protein, the Ad Pan-7 fiber protein, the Ad SV-l ftberl protein, the Ad SV-l fiber2 protein, the Ad SV-25 fiberl protein, 
the Ad SV-25 fiber2 protein and the Ad SV-39 fiber protein, respectively. 
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24). Each of groups 6-2 10 is directed to the different proteins produced by one of five different adenoviruses. For proteins of one 
adenovirus, these proteins are materially different proteins with different functions, and thus share no special technical feature with one 
another. The proteins of groups 6-210 are materially different compounds from the nucleic acids which encode them. Inventions directed 
to different products are not recognized as sharing a special technical feature under 37 CFR 1.475(b). 
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